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ABSTRACT 


Processor-sharing  models  of  time-shared  computer  systems  are  defined 
and  some  new  results  are  presented.  The  major  emphasis  of  this  research  is 
on  the  modeling  and  analysis  of  new  models  of  time-shared  computer  systems 
and  on  the  finding  of  some  fundamental  properties  which  apply  to  the  average 
nusber  of  customers  in  the  system  and  the  average  response  time  functions  for 
a  large  class  of  tine-shared  computer  systems. 

The  family  of  selfish  scheduling  algorithms  is  defined  and  the  Laplace 
transform  of  the  response  time  functions  are  obtained.  The  selfish  round 
robin  (SRR)  and  the  selfish  foreground  background  (SFB)  systems  are  given 
as  two  illustrative  examples. 

A  family  of  scheduling  algorithms  whose  performance  ranges  between  that 
of  the  RR  system  and  the  FB  system  is  constructed.  A  weighting  function  g(t) 
is  given  to  define  the  scheduling  algorithm  such  that  a  customer's  rate  of 
attaining  service  depends  on  how  much  service  time  t  he  has  already  got. 

The  average  response  time  function  for  this  family  of  systems  is  obtained. 

A  simple  relationship  between  the  time-dependent  average  number  of 
customers  in  the  system  and  the  average  response  time  functions  for  a  large 
class  of  M/M/1  systems  is  formulated.  The  result  shows  that  the  behavior  of 
one  customer  can  strongly  Influence  the  total  number  of  customers  in  the 
system. 

Finally,  some  fundamental  properties  are  established  which  apply  to  the 
average  response  time  functions  for  all  time-shared  computer  systems.  Among 
them,  tight  upper  and  lower  bounds  on  the  average  response  time  are  obtained. 
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CHAPTER  1 


INTRODUCTION 

1.1  Time-Sharing  Computer  Systems 

The  value  of  time-shared  processing  systems  as  a  means  of  pro¬ 
viding  a  processor  to  many  users  concurrently  is  well-established.  The 
rationale  for  most  time-sharing  systens  is  to  provide  fast  service  for 
customers  with  short,  highly  interactive  programs  in  order  to  facilitate 
debugging,  to  encourage  experimentation  with  improvisation  of  oonputing 
methods,  and  to  support  general  interactive  computations.  In  the  typi¬ 
cal  operation  of  such  a  system,  the  users  ocmunicate  with  the  corrputer 
by  means  of  teletype  or  similar  1/0  devices.  As  each  user  makes  a  re¬ 
quest  for  computer  processing,  he  in  effect  enters  a  queue  whose  mem¬ 
bers  are  served  in  a  way  determined  by  the  specific  scheduling  algorithm 
being  used.  Here  we  define  a  scheduling  algorithm  as  a  set  of  decision 
rules  determining  which  user  will  next  be  serviced  and  hew  long  he  will 
be  given  use  of  processing  facilities.  Thus  each  program  in  turn  is 
transferred  into  memory,  operated  upon  and  transferred  out.  It  is  obvi¬ 
ous,  unless  this  swapping  of  programs  can  be  done  at  no  cost  in  time  or 
the  memory  is  large  enough  so  that  no  swapping  of  programs  is  needed, 
that  this  mode  of  operation  is  less  efficient  than  the  batch  system 
where  each  request  is  run  to  completion.  The  technique  of  time-sharing, 
however,  results  in  fas tea:  average  response  time  for  the  user  with  short 
request.  This  fast  response  makes  it  appear,  to  such  a  user,  that  he  is 
the  only  one  using  the  computer. 
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Ihe  effectiveness  of  the  tine-sharinq  systems  depends  in  large 
part  on  the  efficiency  with  which  the  resources  are  allocated  to  the 
individual  users.  Thus,  considerable  attention  has  been  focused  on  the 
time  and  space  scheduling  problems  of  time-sharinq  systems  and  many  ana¬ 
lytical  results  have  been  obtained  (1)  since  the  appearance  of  the  first 
applied  paper  published  in  1964  [2] .  In  most  of  the  results,  cr.ly  one 
resource  (the  CPU)  is  to  be  shared,  where  it  is  assured  that  the  size  of 
the  train  memory  is  infinite.  There  are  two  reasons  for  this  assumption: 
in  the  past,  most  of  the  tools  used  to  analyze  time-sharing  systems  have 
been  drawn  from  queueing  theory,  and  it  is  very  difficult  to  analyze  a 
system  with  two  resources  (thus  two  queueing  structures)  which  are  not 
independent  of  each  other.  The  second  reason  is  that  modeling  of  pro¬ 
gram  behavior  and  peripheral  devices  is  very  difficult.  Of  course,  the 
assumption  of  infinite  memory  somewhat  weakens  our  models.  Hcwever,  the 
CPU  is  one  of  'he  most  irrportant  resources  in  the  oonputer  system,  as 
long  as  the  size  of  the  main  memory  is  adequate  (so  that  the  system  is 
not  memory  bound) .  The  analysis  of  single  resource  systems  gives  us  a 
good  feeling  of  how  time-sharing  systems  behave.  In  this  dissertation, 
we  concentrate  on  the  single  resource  case.  We  analyze  a  class  of  such 
systems  and  also  give  some  general  behavior  constraints. 

1.2  An  Existing  Time-Sharing  Computer  System 

In  this  section  we  describe  an  existing  time-sharing  computer 
system.  We  choose  the  Model  67  of  the  IBM  System/3  0  as  our  example. 
The  following  description  is  quoted  from  Gibson  (3] . 

The  basic  architecture  of  the  IBM  System/360  makes  it  well 
suited  to  processing  in  a  multiprogramming  and  multiprocessing  environ¬ 
ment.  The  Model  67  extends  this  basic  architecture  to  provide  the 
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additional  capabilities  ot  an  advanced  time- sharing  system. 

The  Model  67  incorporates  nul ti prograrmi ng ,  multiprocessing,  and 
nultiaccess  capabilities.  Multiaccess  allows  several  users  at  remote 
consoles  to  c  annum  cate  directly  with  the  systen  and  to  present  a  number 
of  applications  ranging  from  conversational  carpi  ling  to  desk  calculator 
functions.  Mul tiprogrami ng  is  defined  as  the  ability  to  have  several 
active  programs  reside  in  core  simultaneously.  As  soon  as  one  job  is 
finished,  or  is  held  up  by  an  I/O  request,  or  has  depicted  its  time 
allowance,  the  next  task  can  begin  immediately. 

The  dynamic  relocation  feature  built  into  the  hardware  facili¬ 
tates  nu ltiprogr arming ;  peripheral  operations  will  now  be  just  like  any 
other  tasks  in  the  memory.  Even  without  the  nultiaccess  capability, 
i.ultiprograrming  provides  much  more  efficient  utilization  of  the  com¬ 
puter's  resources  than  in  a  stacked  job  operation.  For  the  first  time, 
a  centred,  processing  unit  is  a  resource  that  can  be  allocated.  With 
multiaccessing,  where  sane  of  the  jobs  in  core  belong  to  remote  termi¬ 
nals,  the  mul tiprogr arming  capability  is  further  enhanced  as  this  en¬ 
ables  the  rapid  switching  between  jobs,  or  "time-slicing." 

The  Model  67  enables  each  processor  of  a  multiprocessor  system 
to  operate  as  a  single  processor  with  its  own  I/O  subsystem,  or  jointly 
with  other  processors  in  a  syrrmetric  multiprocessing  configuration. 

1.3  The  Mathematical  Model 

Figure  1.1  shews  a  general  feedback  queueing  model  where  the  CPU 
is  being  shared.  Incoming  jobs  are  queued  and  scheduled  for  service  in 
seme  way.  At  its  scheduled  service  time,  each  job  is  processed  for  a 
time  period  called  a  quantun.  If  during  this  quantam  the  job  is  cou¬ 
ple  ted,  that  job  departs  and  service  begins  on  the  next;  otherwise,  the 
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uncompleted  job  rejoins  the  system  of  queues  to  await  further  service. 

In  some  systems,  priorities  are  assigned  to  customers.  These  priorities 
can  be  assigned  externally  [4]  •  they  can  be  assigned  to  the  customers  as 
functions  of  their  attained  service  time  (the  amount  of  service  time  so 
far  obtained  by  a  customer)  [5] ;  or  they  can  be  assigned  as  functions  of 
their  waiting  time  (6) ,  etc.  Such  priority  queueing  systems  are  called 
preenptive  if  the  customer  in  the  service  facility  is  preempted  whenever 
there  exists  another  customer  in  the  system  who  has  higher  priority. 


CZNERAL  FEEDBACK  QUEUEING  MDEEL 
FIGUFE  1-1 

It  is  necessary  to  specify  the  arrival  and  the  service  processes 
before  any  analysis  can  be  carried  out.  Let  A(t)  denote  the  distribu¬ 
tion  function  of  the  interarrival  times  with  average  time  1/A  seconds. 
If  the  interarrival  times  are  exponentially  distributed  as 

A(t)  ■  P (interarrival  time  <  t]  =  1  -  e  t  >  0  d»l) 

then  the  arrival  process  is  called  Markovian  (Poisson) .  Otherwise,  it 
is  called  general.  Also  let  us  use  B(x)  to  denote  the  distribution 
function  of  the  service  times  with  mean  request  equal  to  1/p  seconds. 
If  the  service  times  are  exponentially  distributed  as 
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x  >  0 


(1.2) 


B(x)  =  P (service  time  <  x]  -  1  -  e 

then  the  service  process  is  referred  to  as  Markovian  (exponential)  to 
differentiate  it  from  the  general  case. 

Usually,  two  letters  and  a  number  are  used  to  specify  the  ar¬ 
rival  and  service  processes  as  well  as  the  nirnber  of  servers  in  the  sys¬ 
tem.  The  first  letter  is  used  to  specify  the  arrival  process,  the  sec¬ 
ond  letter  for  the  service  process,  and  the  r.’inber  is  used  to  designate 
the  mmber  of  servers  in  the  system.  The  letter  M  is  used  for  the 
Markovian  process,  and  the  letter  G  is  used  to  represent  a  general 
process.  All  of  the  models  to  be  analyzed  in  this  dissertation  are  of 
either  fVH/1  or  M/G/l  type,  namely,  there  is  one  server  (CPU)  in  the 
system;  the  arrival  process  is  Markovian  (Poisson) ;  and  tl'e  service 
process  is  either  exponential  or  general. 

The  utilization  factor  p,  representing  the  percentage  of  time 
that  the  system  is  busy,  is  defined  as  the  ratio  of  the  average  arrival 
rate  and  the  average  service  rate. 


p  has  to  be  smaller  than  1  so  that  the  average  work  load  offered  to  the 
processor  is  less  than  its  capacity  to  handle  such  a  load. 

Another  interesting  quantity  in  the  system  is  the  size  of  the 
quantum  which  is  defined  as  the  time  interval  allocated  to  a  customer 
when  he  enters  the  service  facility.  In  a  real  system,  the  quantum  size 
has  to  be  finite  in  order  to  get  any  work  done,  but  the  analysis  tends 
to  be  difficult  and  the  results  tend  to  be  in  ocrplex  form  under  this 
assunption  (1] .  In  1967,  the  notion  of  allowing  the  quantum  to  shrink 
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to  zero  vas  first  studied  [4]  and  is  referred  to  as  "processor-sharing." 
As  the  name  implies,  this  zero-quantum  limit  provides  a  share  or  portion 
of  the  processing  unit  to  many  customers  simultaneously.  Under  the 
assumption  of  processor-sharing,  the  difficulty  in  analysis  disappears 
in  large  part  and  the  results  tend  to  be  in  sinpler  form.  Of  course, 
this  assumption  of  infinitessimal  quantum  can  never  be  reached  in  prac¬ 
tice  due  to  the  consideration  of  overhead  time;  nevertheless,  it  usually 
cam  serve  as  a  good  approximation  of  the  actual  systans. 
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CHAPTER  2 


QUEUEING  THEORY  TOOLS  AND  SUMMARY  OF  ANALYTIC  RESULTS 
FOR  TIME-SHARED  SYSTEMS 

2.1  Queueing  Tlieory  Tbols 

Queues  were  first  studied  systematically  by  Erlang  [7] .  Others 
who  have  made  key  contributions  to  the  mathematical  theory  of  queues  cure 
Pollaczeck  [8,9],  Kolcmogorov  [10],  Kendall  [11,12],  Lindley  [13],  and 
Takacs  [14-17] ,  to  mention  a  few.  Mathematical  models  of  time-sharing 
systems  cure  stochastic  in  nature  and  their  analysis  thus  draws  heavily 
on  queueing  theory  results.  In  this  section,  we  present  sane  of  the 
queueing  theory  results  that  will  be  used  later  in  this  dissertation  for 
the  analysis  of  our  mathematical  models. 

2.1.1  Little's  Result  [18] 

Let  n  denote  the  expected  nunber  of  customers  in  a  queueing 
system  and  T  the  expected  time  that  they  spend  in  the  system.  Assume 
that  the  average  rate  of  arrival  is  X.  Refer  to  Figure  2-1. 


X 


n,T 


A  GENERAL  QUEUEING  SYSTEM 
FIGURE  2-1 

We  assume  that  the  box  is  "conservative"  in  the  sense  that  customers  are 
neither  created  nor  destroyed  nor  on  the  average  accumulated  within  that 
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system,  and  so  clearly,  the  average  departure  rate  must  be  A.  Under 
the  constraint  that  the  stochastic  process  involved  is  erogodic  [19]  # 
Little  [18]  proved  that  the  following  relationship  is  always  true. 

n  =  AT  (2.1) 


2.1.2  Manoryless  Property  of  the  Markovian  Process  [19] 

As  the  nane  indicates,  the  past  history  of  a  random  variable 
which  is  distributed  exponentially  in  no  affects  its  future.  Hie  fol¬ 
lowing  equation  expresses  this  property  for  a  random  variable  T. 

P[T  <  t  +  tQ|T  >  tQ]  «  1  -  e~Xt  «  P[T  <  t]  t  >  0  (2.2) 

Hie  distribution  of  time  until  the  next  event  (say,  an  arrived.)  occurs 
given  that  tg  seconds  have  elapsed  since  the  occurrence  of  the  last 
event  is  identically  equal  to  the  distribution  of  time  until  the  next 
event  occurs  measured  from  the  time  when  the  last  event  occurred.  Hius 
the  time  a  future  event  occurs  is  independent  of  hew  long  it  has  been 
since  the  last  event  occurred.  In  other  words,  the  Markovian  process  is 
manoryless . 

2.1.3  Markovian  Process  (M/M/1)  119] 

For  an  f0Vl  system  with  infinite  queueing  room,  since  both  the 
arrival  and  the  service  process  are  Markovian,  the  all-important  memory¬ 
less  property  holds,  and  the  results  of  birth-death  processes  can  be 
applied  directly.  Far  such  systems,  the  equilibria  probability  of  hav¬ 
ing  n  customers  in  the  systen  is  given  as 

p  =  (1  -  p)pn  n=  0,1,2,...  (2.3) 

n 

where  p  is  the  utilization  factor  defined  by  Eq.  (1.3),  and  p  <  1. 
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The  expected  nunber  of  cus toners  in  the  systan  n  can  be  calculated  as 


5  = 


(2.5) 


n=0 


We  may  new  apply  Little's  result  in  order  to  obtain  the  average  time 
spent  in  the  systan  as  follows: 


T  =  -  =  iZH_ 

1  X  1  -  p 


(2.6) 


2.1.4  The  Imbedded  Markov  Chain  (H/G/l)  [12] 

For  general  service  time  distributions  the  nice  property  of 
memory  lessness  no  longer  exists,  and  the  results  of  birth -death  pro¬ 
cesses  can  no  longer  be  applied  directly  to  the  systan.  However,  when 
the  system  is  studied  at  discrete  time  points,  the  collection  of  state 
probabilities  may  constitute  a  Markov  chain.  Kendall  [12]  introduced 
the  concept  of  an  imbedded  Markov  chain  so  that  a  non-Markovian  process 
can  be  studied  by  extracting  a  set  cf  points  (called  regeneration  points) 
at  which  the  Markov  property  holds.  For  a  MAVl  system,  the  set  of 
departure  instants  from  service  is  an  extremely  convenient  set  of  regen¬ 
eration  points.  It  is  clear  that  if  we  specify  the  nunber  of  customers 
left  behind  by  a  departing  customer,  we  can  calculate  the  same  quantity 
at  some  point  in  the  future  given  only  additional  inputs  to  the  systan. 
From  the  analysis  of  the  imbedded  Markov  chain,  we  get  the  following  two 
important  results: 

A.  Pollaczek-Khinchin  Formula  [19] 

The  average  nuiber  of  queueing  customers  (those  customers 
waiting  in  the  queue)  left  behind  by  a  departing  customer  is  given  as 


2  + 

q'p  +  p 


(2.6) 
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where  C^.  is  the  coefficient  of  variation  defined  as  the  ratio  of  the 

standard  deviation  o.  of  the  service  time  distribution  to  its  mean 

o 

value. 


°b  =  I7£  =  y°b 


(2.7) 


If  we  aoply  Little’s  result  to  Eg.  (2.6) ,  we  get  the  average  time  spent 


in  the  s' ’Stem  as 


(l  +  4 

•  2d  -“py 


(2.8) 


Equation  (2.8)  is  easily  interpreted.  The  average  total  tine  spent  in 
the  systan  is  clearly  the  average  time  spent  in  the  service  plus  the 
average  time  spent  in  the  queue. 

B.  Distribution  of  Waiting  Time  [19] 

Q(z)  =  B*(X  -  Xz)  b*(X  ?&’-**■  (2'( 


where  Q(z)  is  the  z  transform  of  the  distribution  function  of  the  nun- 
ber  of  customers  in  the  queue.  Let  pn  denote  the  stationary  probabil¬ 
ity  that  there  are  n  customers  in  the  queue,  then  Q(z)  is  defined  as 

Q(z)  =  Epnzn  (2.10) 

0  n 

B*(s)  is  the  Laplace  transform  of  the  service  time  density  function 


defined  by  B*(s) 


f° e”SX<fi(x) . 


From  Eq.  (2.9)  the  Laplace  transform  S*(s)  of  the  distribution 
of  total  time  spent  in  the  M'Q/l  systan  can  be  obtained  as 


S*(s)  -B*(«)  —  W  Xb^W 


(2.11) 
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Since  the  waiting  time  in  the  queue  for  a  customer  is  independent  of  his 
own  service  time*  we  easily  get  the  Laplace  transform  of  the  waiting 
time  as 

«*<=>  -  T=&T&ST  (2'12) 

Differentiating  Eqs.  (2.11)  and  (2.12) .  various  moments  of  the  waiting 
time  and  the  system  time  can  be  derived. 

2.1.5  the  Busy  Period  [19] 

The  queueing  system  can  be  viewed  as  passing  through  alternating 
cycles  of  busy  periods  and  idle  periods  as  depicted  by  Figure  2-2. 


H - Y, - f—' V +— 


THE  UNFINISHED  WORK  AND  THE  BUSY  PERIOD 
FIGUre  2-2 

In  this  figure  we  plot 

U(t)  =  the  unfinished  work  in  the  system  at  time  t 

=  the  time  required  to  empty  the  systan  of  all  customers 
present  at  time  t. 

We  assure  that  customers  arrive  at  time  epochs  T^/Tg^T^, . . .  .  Each 
arrival  to  the  system  will  add  seme  unfinished  work  to  the  system 
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(namely,  his  service  time)  as  shown  by  Figure  2-2.  U(t)  is  sometimes 
referred  to  as  the  virtual  waiting  time  at  time  t.  Behavior  of  this 
function  is  extremely  important  in  understanding  queueing  systems  when 
one  views  them  fron  the  point  of  view  of  the  busy  period.  In  Figure 
2-2,  *1^2'  •••  represents  busy  periods,  and  I^,l2,  •••  represents 
idle  periods. 

For  M/G/l  systems  in  general,  since  the  arrival  time  distribu¬ 
tion  is  memory  less,  the  moments  of  the  idle  period  are  the  same  as  the 
moments  of  the  in  ter  arrival  time,  namely, 

F(r)  »  P [idle  period  time  <  r)  =  1  -  e~*r  r  >  0  (2.11) 

The  analysis  for  the  busy  period  distribution  is  much  more  com¬ 
plicated.  The  result  is  given  by  the  following  recursive  equation  [19] . 

P*(s)  =  B*[s  +  X  -  XP*(s)]  (2.12) 


where  P*(s)  is  defined  as  the  Laplace  transform  of  the  distribution 
f  motion  of  the  busy  period.  From  Bg.  (2.12)  the  first  two  moments  of 
the  length  of  the  busy  period  can  be  obtained  as 


% 


(2.13) 


*2 


(1  -  P)3 


(2.14) 


“7 

where  t  is  the  second  moment  of  the  service  time  distribution.  Com¬ 
paring  Eqs.  (2.13)  and  (2.6)  we  find  that  the  average  length  of  a  busy 
period  for  the  system  M/G/l  is  equal  to  the  average  time  a  customer 
spends  in  an  H/H^l  system. 
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2.2  Review  of  Some  Analytic  Results  for  Time-Shared  Systems 

In  this  section  we  wish  to  present  sene  of  the  analytic  results 
that  were  obtained  in  the  past.  They  served  as  a  point  of  departure  for 
the  research  of  this  dissertation.  The  emphasis  of  this  collection  of 
results  is  on  those  for  the  processor-sharing  models,  although  sane  of 
the  relevant  results  for  the  finite-quantun  systans  are  also  presented. 
Typically,  the  quantity  that  is  solved  for  in  a  time-sharing  system  is 
the  distribution  of  the  response  tine  which  is  defined  as  the  total  tine 
a  customer  spends  in  the  systan  conditioned  on  that  he  requests  and  gets 
t  seconds  of  service.  Mast  of  the  tine,  however,  we  can  only  solve  for 
the  average  response  tine  defined  as 

T(t)  =  E  [response  tine  for  a  customer  conditioned  on  that  he 

requests  t  seconds  of  service]  (2.15) 

Another  quantity,  W(t) ,  defined  as  the  average  amount  of  wasted  (wait¬ 
ing)  time  spent  in  the  systan,  is  also  often  used  as  a  performance  mea¬ 
sure  for  time-sharing  systems,  clearly 

W(t)  =  T(t)  -  t  (2.16) 

Swap  time  is  assumed  to  be  negligible  for  the  following  results; 
its  effect  on  the  response  tine  can  often  be  taken  into  consideration  by 
reducing  the  average  service  rate  of  the  service  facility  [20] . 

2.2.1  First-Oome-First-Served  (FCFS)  Systan 

This  systan  is  also  known  as  batch  processing.  New  comers 
always  join  the  tail  of  the  queue  (there  is  only  one  queue  in  this  sys¬ 
tan)  and  once  a  customer  enters  the  service,  he  will  be  served  until 
oorpletion.  We  can  regard  this  as  a  special  case  of  tine-sharing  sys¬ 
tans  with  infinite  quantum  size.  The  average  response  tine  for  this 
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case  with  general  service  time  distribution  is  (19  J 


T(t) 


27T-  p) 


+  t 


(2.17) 


and 


X? 

w(t)  “  KrHr  {2-18) 

T 

where  t  is  the  second  moment  of  the  service  time  distribution.  A 
very  important  characteristic  of  W(t)  is  that  it  is  independent  of  t. 
Fbr  the  system  of  exponentially  distributed  service,  Eq.  (2.18)  becomes 


2.2.2  Last-Oome-First-Served  (LCFS)  System 

In  this  system  a  newly  arrived  customer  captures  the  use  of  the 
server  until  he  leaves  completely  served  or  until  he  is  preempted  by  a 
nstfly  arriving  customer.  At  all  times,  the  customer  who  has  been  in  the 
system  for  the  least  amount  of  time  will  occupy  the  service.  No  more 
than  one  customer  can  be  in  the  service  at  any  time.  The  average  re¬ 
sponse  time  for  the  LCFS  system  is  given  by  (19J 

T(t)  “  T^~p  (2.20) 

2.2.3  Shortest  Job  First  Served  (SJF)  System 

In  this  system  the  server  selects  the  customer  in  the  queue  with 
the  shortest  required  service  time  and  serves  it  until  completion. 

This  algorithm  requires  the  knovl  adge  of  the  service  time  request  in 
advance  which  is  usually  assumed  to  be  '■^available  in  other  algorithms. 
The  average  response  time  with  general  service  distribution  is  given  as 
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121] 


7  Xt  <t 

T(t)  =  — _5i_  (2. 21) 

(i  -  xt<tr 

with  ~?<t  and  t<t  defined  by  Eq.  (2.51). 

2.2.4  Round  Rabin  (RR)  System 
A.  Finite-Quantun 

This  discrete  time  model  was  first  studied  by  Kleinrock  [22]. 
The  system  works  in  the  following  way :  arriving  customers  are  queued  in 
order  of  arrival.  The  server  selects  the  customer  at  the  head  of  the 
queue  and  services  him  for  at  most  Q  seconds,  where  Q  is  the  quantun 
sire.  If  after  Q  seconds  of  service  the  customer  needs  more,  he  is 
returned  to  the  end  of  the  same  queue.  The  service  time  of  a  newly 
arriving  customer  is  chosen  independently  from  a  geometric  distribution 
such  that  for  o  <  1, 

S  *  (1  “  0)0°  ^  n  =  1,2,3,...  (2.22) 

n 

where  Sn  is  the  probability  that  a  customer's  service  request  is 
exactly  nQ  seconds.  See  Figure  2-3. 


ptr 


TOE  DISCFETE  TIME  ROUND  ROBIN  MOEEL 
FIGUre  2-3 
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At  the  end  of  each  time  interval,  a  new  customer  arrives  to  the  system 
with  probability  Q;  thus,  the  average  arrival  rate  is  X.  There  are 
two  types  of  systers  dependent  vpon  the  order  in  thich  a  new  arrival  and 
the  ejection  of  the  customer  in  service  can  take  place  at  the  end  of 
each  quantum.  The  average  response  time  for  a  customer  needing  nQ 
seconds  in  the  early- arrival  system  is  given  by  [22 J 


nO  XQ2p  „  .  (1  -  ou) (1  -  an_1) , 

T(nQ)  -  r?p  -  *  -  11  ♦  ,,  .  a)2(1  1 

(2.23) 

where 

a  ■  o  +  XQ 

(2.24) 

-r3r 

(2.25) 

Similarly, 

seconds  of 

the  average  response  time  for  a  customer  requesting  r Q 

servioe  in  the  late-arrival  system  is  given  by  [22) 

tw»  -  - r£r  a  *  (1  -  (s  -  — i 

1  ~  p  1  P  (1  -  a)2(l  -  p) 

(2.26) 

B. 

Processor-Sharing 

As  the  quantun  siae  shrinks,  customers  get  served  at 

a 

faster  rate  but  with  less  service  each  time,  for  the  limit  case  of  zero 
quantum,  a  customer  is  required  to  make  an  infinite  muter  of  cycles, 
each  infinitely  quickly  and  each  time  receiving  infinitessimal  service, 
until  he  finally  accumulates  enou^i  service  time  (to  be  equal  to  his 
request) ,  at  which  time  he  leaves.  The  average  time  for  such  a  proces¬ 
sor-sharing  RR  system  is  given  as 

T(t)  ■  r=r  (2,27) 
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and 


W(t)  = 

1  -  p 


(2.28) 


Kleinrock  (4)  obtained  this  for  the  case  of  exponential  service  distri¬ 
bution.  Sakata  [23,24]  proved  that  it  is  also  true  for  general  service 
distribution.  Coffman,  Muntz,  and  Trotter  [25]  solved  for  the  Laplace 
transform  of  the  wed  ting  time  distribution  for  the  system  M/ty/l. 


WAITING  tim;  function  for  the  fcfs  and  rr  systems 

FIGURE  2-4 

In  Figure  2-4  we  plot  W(t)  against  t  for  the  FCFS  and  the  RR 
systems.  Roth  of  then  are  straight  lines.  Let  us  assume  that  these  two 
lines  intersect  each  other  at  the  point  t  -  t^.  This  point  t^  is  of 
great  interest  to  us  because  for  a  customer  requesting  less  than  t^ 
seconds  of  service,  then  he  has  to  wait  more  than  the  average  as  repre¬ 
sented  by  FCFS  when  he  is  in  a  RR  system,  t^  can  easily  be  calculated 
as 
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(2.29) 


pti 

r^p 


ss 


X? 

2(1  -  pT 


(2.30) 


E\Dr  exponential  service  distribution, 


? 


2 

“2 

u 


thus, 


_  1  (2.31) 

i  M 


For  IVH^I  systems,  any  customer  who  needs  more  than  the  average  service 
request  (1/p  seconds)  will  be  better  off  as  measured  by  his  average  re¬ 
sponse  time  when  he  is  in  the  FCFS  system  than  in  the  RR  system. 

2.2.5  Round  Rabin  with  Priorities  System 

In  this  model  we  assume  that  an  external  priority  assignment  is 

made  to  the  arriving  customers.  We  assure  that  there  are  P  priority 
groups  with  Poisson  arrivals,  each  at  an  average  rate  Xp  per  second 
and  an  exponentially  distributed  service  requirement  with  mean  request 
of  1/yp  seconds  for  the  p**1  group.  A  processor-sharing  model  is  as¬ 
sured.  A  positive  nurtoer  gp  which  denotes  the  relative  fraction  of 
the  processing  time  that  is  reserved  for  the  customers  from  the  P 
priority  group  is  associated  with  the  p^  priority  group,  with  larger 
values  of  gp  being  given  to  those  hi^er  priority  groups.  Tbe  average 
response  time  Tp(t)  for  a  customer  fron  the  p**1  priority  group  is 

given  by  [4] 


p  *  1,2,...  ,P 


(2.32) 
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where 


(2.33) 


E 

P  =  £p4  (2.34) 

i=l  1 


2.2.6  Selfish  Round  Rabin  (SRR*  System 

This  system  was  introduced  by  Kleinrock  (26)  for  the  processor¬ 
sharing  model  only.  He  solved  for  the  mean  response  time  for  the  case 
of  exponential  service  time  distribution.  'Ihe  algorithm  works  in  the 
following  way:  Ftor  each  customer  in  the  systtn,  a  tJ me-vary ing  value  of 
priority  is  assigned.  This  priority  value  begins  at  zero  upon  his  entry 
to  the  systan.  It  increases  at  a  positive  rate  a  as  long  as  he  is  not 
served;  whenever  he  is  in  the  service  facility,  his  priority  value  in¬ 


creases  at  a  positive  rate  8  where  a  >  8  >  0.  All  the  customers  in 
the  service  facility  share  it  equally  among  them  as  in  a  processor¬ 
sharing  RR  system.  Note  that  a  queueing  customer  gains  priority  at  a 
rate  greater  than  those  in  the  service.  Eventually  he  will  catch  up 
with  those  in  service  and  then  join  and  remain  with  that  group.  Since 
the  customers  in  service  cure  attempting  to  run  away  with  the  processor 
and  prevent  waiting  customers  from  joining  than  (there  attempt  is  futile, 
though) ,  this  system  is  called  selfish  round  robin. 

Ihe  average  response  time  for  the  SRR  system  is  [26] 


T(t) 


14; 

I  -  P 


+ 


(t  -  1/u) 


1  -  P(1  -  8/a) 


(2.35) 


and  also  the  average  waiting  time  is  given  by 
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(2.36) 


Hie  ratio  8/a  provides  one  degree  of  freedom  which  can  be  ad¬ 
justed  over  a  continuum  of  system  behaviors  ranging  from  the  PCFS 
(0/a  =  1)  to  the  RR  system  (0/a  =  0) .  Another  important  property  of 
the  SRR  systems  is  that  a  job  with  average  service  requirement  will  re¬ 
ceive  the  same  response  in  all  of  these  SRR  systems.  Please  refer  to 
Chapter  3  for  more  details. 


2.2.7 


FEEDBACKS 


DISCRETE  TBC  roREGRDUND-BACKGROUTO  MDDEL 
FIGURE  2-5 
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We  assume  that  there  are  N  levels  of  queues  in  a  T3  system  as 
depicted  by  Figure  2-5.  Arriving  cus toners  enter  the  level  1  queue  to 
await  allocation  of  their  first  quantum,  seconds.  If  more  process¬ 

ing  time  is  needed,  they  enter  the  end  of  the  level  2  queue  to  await  the 
second  allocation,  this  time  of  Q2  seconds.  This  process  continues 
until  either  the  cus  toner  leaves  the  systam  completely  served  or  he 
enters  the  NU1  queue.  Members  of  the  Nu  1  queue  are  served  as  in  a 
RR  system.  The  server  services  a  customer  from  the  Ith  queue  only  if 
all  lower  level  queues  (I  -  1,1  -  2, . . .  ,1)  are  empty.  If  a  new  custo¬ 
mer  enters  level  1  during  execution  of  a  customer  from  a  higher  queue, 
the  current  customer  is  not  preempted  until  the  allocated  quantum  is  ex¬ 
pired.  The  average  response  time  with  =  Q2  =  ...  =  has  been 


derived  by  Coffman  and  Kleinrock  [5]  as 


T(t) 


_ P/V 

(1  -  p)(l  -  p (1  - 


p[l  .  <N-1)Q] 

1  -  p[l  - 


(K  -  1)Q  +  t 


K  >  N 


(2.37) 


T(t)  = 


(^/2)  [E^(t2)  +  Y^T2)] 

(1  -  p(l  -  e-lJKQ)]  (1  -  o(l  -  e"M(K‘1)Q] 


Dfl  -  e_lj(K“1)Q] 

+  — ^ - ■  '"  uivAn  (K  -  1)Q  +  t  1  <  K  <  N  (2.38) 

1  -  p(l  -  e  U(K~1,Q] 


where 


(K  -  1)Q  <  t  <  KQ 


(2.39) 
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(2.40) 


'K 


^(t2) 


■/ 


T2dFK(T) 


Fk(t) 


1  _  e-pT  0  <  t  <  KQ 
1  T  >  KQ 


(2.41) 


(2.42) 


Schrage  [27]  has  provided  analysis  of  this  model  with  general 
service  distribution  and  N  «  «.  In  particular,  he  solved  for  the 
Laplace  transform  of  the  response  time  distribution  under  the  assumption 
of  arbitrary  quantum  size  for  each  level.  Schener  [28]  also  contributed 
to  the  infinite-level  FB  model  by  obtaining  the  average  response  time  as 

T(t)  -  t  +  £t.  (2.43) 

i-1  1 

where  K  is  defined  by  the  following  inequality 

K_i  K 

£q,  <  t  <  £Q.  (2.44) 

i-1  1  '  i=l  1 

and  is  given  as 

t.  •  QjfT^  +  +  Q2(Ti-2  +  +  ••• 

+  (Ti_ j  +  Oi.j) +  ...  +  QAXe-‘,<i"1,®E  (2.45) 

with 

Qi  ■  i  [1  -  e_vQi]  (2.46) 
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(2.47) 


QA  =  >  +  Ei/i+ 


-PQ, 


HV1 

Qb  =  Q1(l  -  e  L)  +  £Q.(e 


Mti 
e  x) 


(2.48) 


ci =  £°i 


:“3 


(2.49) 


B.  Processor-Sharing 

Coffman  and  Kleinrock  [5]  and  Schrage  [27]  independently  derived 
the  average  response  time  for  this  case.  With  general  service  time  dis¬ 
tribution,  T(t) ,  is  given  by 


W  ♦  t 
T(t)  =  f  ,  ■■■■■ 

1  P<t 


(2.50) 


where 


xndB(x) 


.  n 

+  X  J 


dB(x) 


(2.51) 


p<t  =  *fc<t 


... 

w<t  =  HI  -  p~T 


(2.52) 


(2.53) 


Schrage  [27]  alro  obtained  the  Laplace  transform  of  the  response 
time  function  for  JVG/1  systara.  Refer  to  Chapter  3  for  more  details. 
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2.2.8  Multilevel  Processor-Sharing  Systems 

Multilevel  (Mj)  queueing  models  were  first  analyzed  by  Kleinrock 
and  Maitz  129].*  Itiey  can  be  considered  as  a  generalization  and  consol¬ 
idation  of  the  PCFS,  the  RR,  and  the  FB  systems.  In  particular,  a  set 
of  attained  service  times  {a^}  is  defined  such  that 

0-«0<>l<a2<-<aN<Vl'"  (2-54) 


The  discipline  for  a  job  when  it  has  attained  service,  t,  in  the 
interval 

3  <  j  <  3,  i  B  l;2f3f  •  •  •  i  N  ^  1  (2*55) 


will  be  denoted  as  D<.  Where  is  considered  for  ary  given  level  to 
be  either  PCFS,  FB,  or  RR.  Moreover,  between  intervals  the  jobs  axe 
treated  as  a  set  of  FB  disciplines.  The  behavior  of  the  average  condi¬ 
tional  response  time  in  any  particular  level  is  independent  of  the 
discipline  in  all  other  levels.  See  Figure  2-6. 

An  expression  for  T(t) ,  the  mean  response  time  for  jobs  with 
service  tire  t  su=h  that  a^  <  t  <  a^  i.e.,  jobs  which  reach  the 
i**1  level  queue  and  there  leave  the  system,  has  been  obtained  by 
Kleinrock  and  Muntz  as  [29] 

a.  1th  level  discipline  is  FB 


T(t) 


>7 


2(1  -  p<t> 


(2.56) 


*lWo  other  not  yet  published  papers  by  these  authors  are  "Pressor- 
Sharing  Queueing  Models  of  Mixed  Scheduling  Disciplines*;  and  lfte  Pro¬ 
cessor-Sharing  Queueing  Model  for  Time-Shared  Systems  with  Bulk  Arrivals* 
(with  E.  Hodemich) . 
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FB 
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LEVELS 


ATTAINED  SERVICE,  1 


0 


where 


INTERVALS  OF  ATTAINED  SERVICE  WITH  DISCIPLINES,  Di 

FIGURE  2-6 


b.  i  level  discipline  is  FCFS 


T(t)  = 


W<a.+t 


1  -  P 


^i-l 


(2.57) 


At 


W 


<ai 


<ai  2(1  ’  P<ai) 


,  th 


.  th  ... 


c.  i  level  discipline  is  RR 

In  this  case,  the  results  are  limited  in  the  i1"1*  interval 
to  service  time  distribution  in  which 


B(x)  =  1  -  p(x)e 


-8x 


ai-i  i  *  <  ai 


p(x)  =  P0  +  PjX  +  ...  +  pnx 


n 


(2.58) 

(2.59) 
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The  service  time  distribution  F(x)  for  this  i**1  level  is  then 


F(x) 


B(ai_l  +  x)  -  Bfa^) 


1  -  q(x)e 


-Bx 


0  <  x  <  -  a^_^ 


(2.60) 


x  >  ai  -  ai-i 


where 

aj-l 

e  ‘  P(ai_!  +  *)  n 

<J(x) - 1  -  b(Vi) — *qo  +  <J!*+  •••  +  v  (2-61) 

Except  for  the  first  level  the  average  response  time  is  given  as 
T(ai-1  +  T)  "  1  -  p  {W<ai-l  +  ai-i  +  a2(T)}  (2.62) 


where 


a2(x) 


k-  Aar 


b(y2  -  y12) [ (yx  +  p  -  Xa) (1  -  e  1  )  -  Xa  e 


"YiT  _ -(p+Yj^  yxt 


(e  -  1)] 


9  _  -(u+YJx. 

2XaY^  ^  +  P  -  Aa(l  -  e  1  X)J 


(2.63) 


W 


X(1  -  e 


-pa 


i-1 


-pa 


<ai-l 


'  ^ai-le 
P*[ 1  -  £(1  -  e"pai-l)J 


i-1* 


(2.64 


with 


a  * 


1  -  B(ai_1) 


i-1 


(2.65) 
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(2.66) 


Pi  = 


-px. 
(1  -  e  x) 


'i  -Jj- 


j  -2px. 
2pAa  +  (AaP(l  -  e  x) 


(2.67) 


and 


X1  =  ai  -  ai-l 


For  the  first  level,  T(t)  is  given  as 


T(t)  = 


1  -  P 


0  <  t  <  a. 


(2.68) 


<a. 


In  Figure  2-7  we  shew  the  behavior  of  each  of  the  three  disci¬ 
plines  for  the  system  N  =  1  with  exponential  service  distribution.  We 
also  assume  that  p  =  1,  A  =  0.75,  and  a^  =  2. 

2.2.9  Attained  Service  [30] 

A.  Finite-Quantvm 

The  attained  service  for  an  inccnpletely  serviced  customer 
is  defined  as  the  nimber  of  seconds  that  he  has  so  far  spent  in  the  ser¬ 
vice  facility.  Assume  that  there  are  p  priority  groips  in  the  system 


and  let 

yt)  = 


T  (t)  = 
P  n 


density  of  the  nurrber  of  customers  in  the  system  from 
priority  group  p  who  have  so  far  received  exactly  t 
seconds  of  service 

average  arrival  rate  of  the  p**1  priority  group 
service  distribution  of  the  customers  from  the  p**1 
priority  group 

average  response  time  for  customer  from  p^  group 

conditioned  on  that  he  has  attained  t  seconds  of 

n 

service 
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Kleinrock  then  proved  that  the  following  equation  is  true 


yv  -  y1  -  yv’fyw  ■  w  p  =  1>2'3'  ••• p  <2-69) 


B.  Proces  so  r- Shari  ng 
If  we  define 


th 


np{t)  =  average  density  of  customers  from  the  p  priority 
group  still  in  the  system  who  have  so  far  received  t 
seconds  of  service 

then  Eq.  (2.55)  can  be  modified  for  the  process-sharing  models  as  [30) 


dT  (t) 

n  (t)  =  X  [1  -  B  ( t) )  — gr —  p  =  1,2,3,  ...,  P  (2.70) 

p  p  p  at 

2.2.10  Round  Robin  with  Finite  Input  Population 

In  'die  real  world,  there  is  no  such  thing  as  an  infinite  popula¬ 
tion  on  which  most  of  the  mathematical  models  of  time-sharing  systems 
are  based.  Nevertheless ,  if  the  dependence  of  the  arrival  process  upon 
the  nunber  of  customers  in  the  system  is  negligible,  the  assumption  of 
infinite  population  usually  serves  as  a  good  approximation  to  the  real 
system.  In  some  of  the  systems,  however,  the  arrival  process  does  de¬ 
pend  on  the  number  of  customers  in  the  system  in  a  perceptible  way.  Hie 
analysis  of  such  models  with  finite  input  population  then  becomes  a 
necessity. 

Typically,  a  time-varying  model  with  finite  nunber  of  inputs  is 
modelled  as  by  Figure  2-8.  Here  we  have  M  users  which  make  demands  on 
the  time-shared  system.  The  dashed  lines  in  Figure  2-8  surround  a  feed¬ 
back  queueing  model  similar  to  the  one  used  for  infinite  population 
models.  When  a  user  (a  console)  makes  a  request  for  service  to  the  com¬ 
puter,  he  enters  the  dashed  box  and  gets  served  acaonding  to  the 
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FTGUFE  2-8 

scheduling  algorithm  in  the  system.  Vhen  his  request  is  completely 
served,  he  leaves  the  dashed  box  and  starts  to  generate  a  new  request  to 
the  CPU.  The  time  spent  by  the  user  in  generating  this  new  request 
after  the  corpletion  of  the  previous  request  is  referred  to  as  the 
"think  time."  Thus,  alternating  periods  of  thinking  and  processing  take 
plaoe. 

Scherr  [31]  considered  the  case  for  which  he  assures  exponen¬ 
tially  distributed  service  time  and  think  time,  namely 


P  [think  time  <  t]  =  1  -  e_at 

t  >  0 

(2.71) 

average  think  time  =  1/a 

(2.72) 

P  [service  time  <  x]  *  1  -  e_ljX 

x  >  0 

(2.73) 

Average  service  request  *  1/y 

(2.74) 

He  solved  for  the  average  response  time  in  the  system  without 
conditioning  that  result  on  the  service  time  required.  His  result  is 
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(2.75) 


where  pQ  is  the  probability  that  no  customer  is  in  the  queue  or  in  the 
service  facility  and  is  given  by 


(2.76) 


In  Figure  2-9  we  plot  the  norrtalized  waiting  time  yT  against 
the  number  of  input  population  M.  The  point  M*  on  Figure  2-9  is  de¬ 
fined  by  Kleinrock  [32]  as  the  saturation  mirber  because  whenever  the 


NUMBER  OF  CONSOLES,  M 


FINITE  POPULATION  PERFORMANCE  AND  SATURATION 
FIGURE  2-9 

total  number  of  consoles  exceeds  this  number,  every  add:tional  customer 
will  severely  interfere  with  the  existing  customers  as  far  as  the  aver¬ 
age  response  time  is  concerned.  M*  can  be  easily  calculated  as 

M*  =  1/u  +  l/a  m  U  ±  a 

1 7p  a  (2.77) 
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Greenberg  [33] ,  Adiri  and  A vi- Itzhak  [34],  and  Krishnamoorthi 
and  Wood  [35]  also  considered  systems  with  finite  input  population. 
2.2.11  A  Oonservation  Law 

Kleinrock  [36]  showed  that  a  conservation  law  holds  for  any 
queueing  system  with  priorities  which  satisfies  the  following  restric¬ 
tions: 

a.  there  is  a  single  server  in  the  system 

b.  arrival  process  is  Poisson,  service  process  is  arbitrary 
with  arrival  and  service  processes  independent  of  each 
other 

c.  work  can  not  be  destroyed  nor  can  it  be  created  within 
the  system 

d.  preemption  allowed  only  if  service  process  is  exponential 
and  then  preemption  mist  cause  no  losses. 

The  conservation  law  is  that  the  weighted  average  of  waiting 
times  is  a  constant  regardless  of  the  scheduling  discipline,  namely, 


(2.78) 


where  P  is  the  number  of  priority  groups  and 

A  *  average  arrived  rate  for  p^1  priority  group 
P 

Vyp  -  average  service  request  for  a  customer  from  p**1 
priority  group 

Wp  =  average  waiting  time  for  customers  from  p^  priority 


group 


Wq  =  average  amount  of  work  left  in  the  service  facility 
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found  by  an  arriving  customer  (independent  of  schedul¬ 
ing  algorithm) 

2.3  Suttnary  of  Results  in  this  Dissertation 

There  are  two  najor  topics  in  this  dissertation.  The  first 
theme  is  on  the  modeling  and  analysis  of  nw  models  of  time -shared  com¬ 
puter  systems;  the  emphasis  is  on  models  with  some  degrees  of  freedom 
which  the  system  designer  can  use  to  adjust  the  system  performance  over 
a  oontinuun  of  system  behaviors.  In  order  to  provide  those  degrees  of 
freedom  to  the  system  designer,  some  parameters  have  to  be  injected  into 
the  system.  Different  algorithms  with  this  property  are  discussed  in 
Chapters  3  and  4.  The  second  major  topic  is  the  finding  of  sane  funda¬ 
mental  properties  which  apply  to  the  average  nimber  of  customers  in  the 
systen  and  the  average  response  time  functions  for  a  large  class  time- 
shared  ccrputer  systens.  Chapters  5  and  6  are  devoted  to  the  discussion 
of  these  topics. 

Chapter  3  is  devoted  to  the  study  of  the  family  of  selfish 
scheduling  systems  in  general,  with  the  selfish  round  robin  (SRR)  and 
the  selfish  foreground-background  (SFB)  as  two  illustrative  examples. 

In  the  selfish  system,  customers  are  divided  into  two  sets:  those  in 
the  queue  box  waiting  to  be  served,  and  those  in  the  service  box  sharing 
the  service  facility  in  seme  fashion  determined  by  the  scheduling  algo¬ 
rithm  being  used.  When  a  customer  is  waiting  in  the  queue  box,  his 
priority  (a  nurfoer)  increases  at  a  positive  rate  a;  when  he  is  in  the 
service  box,  his  priority  increases  at  a  positive  rate  6.  We  oonsider 
the  case  a  >  8  >  0.  If  the  scheduling  algorithm  in  the  service  box  is 
RR  (namely,  everyone  in  the  service  box  shares  the  facility  equally 
among  thonselves) ,  the  system  is  called  selfish  round  robin  (SRR) .  If 
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only  those  customers  in  the  service  box  with  the  least  amount  of  at¬ 
tained  service  are  sharing  the  server,  then  the  system  becomes  selfish 
foreground-background  (SFB) .  The  ratio  8/a  provides  to  the  system 
designer  a  degree  of  freedom  to  control  the  system  performance.  For  the 
SRR  system,  the  average  response  cime  is  solved  for  general  service  dis¬ 
tribution,  and  the  Laplace  transform  of  the  wad  ting  time  distribution  is 
obtained  for  the  tyfyi  system.  The  Laplace  transform  of  the  waiting 
time  distribution  for  the  SFB  system  is  solved  for  fVG/1  in  general. 

In  Chapter  4  we  discuss  a  family  of  algorithms  whose  performance 
ranges  between  that  of  the  RR  system  and  the  FB  system.  Similar  to  a 
pr oces so r-shared  RR  system,  all  customers  in  the  system  share  the  serv¬ 
ice  facility  simultaneously;  but  unlike  the  RR  system,  the  customers  do 
not  share  the  facility  equally  among  themselves,  rather  their  share  of 
the  processor  varies  according  to  their  amount  of  attained  service  time. 
A  weighting  function  g(t)  =  ge~^t  is  given  to  define  the  scheduling 
algorithm  such  that  a  customer's  rate  of  attaining  service,  given  that 
he  has  attained  t  seconds  of  it,  is  directly  proportional  to  g(t) . 

The  more  service  a  customer  has  accumulated,  the  slower  he  gets  served 
in  the  service  facility.  A  customer  always  gets  some  service  even 
though  he  has  spent  a  long  time  in  the  service  facility.  Thus  this  nw 
scheduling  algorithm  shows  more  discrimination  against  long  jobs  than 
the  RR  systan,  but  less  discrimination  than  the  FB  system.  The  para¬ 
meter  g  provides  one  degree  of  freedom  to  the  system  designer.  With 
g  =  0  the  system  becomes  RR;  with  g  =  °°  it  becomes  FB.  The  average 
response  time  for  this  family  of  systems  is  obtained  in  Chapter  4. 

In  Chapter  5,  conditioned  on  the  presence  of  a  "tagged"  customer, 
we  find  a  simple  relationship  between  the  time-dependent  average  lumber 
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of  customers  in  the  system  and  the  average  r  ,'sponse  time  function  for  a 
large  class  of  systems  with  Poisson  arrival  and  exponential  service 
process-  The  result  shows  that  the  behavior  of  one  customer  can 
strongly  influence  the  total  nurber  of  customers  in  the  system  for  all 
algorithms  except  PR,  in  which  case  the  average  nunber  of  customers  in 
the  system  is  a  constant. 

From  the  results  obtained  in  Chapters  3  and  4,  as  well  as  from 
seme  published  papers  [  1] ,  we  see  that  by  slightly  modifying  the 
scheduler  of  a  time-shared  syston,  a  different  model  can  easily  be  con¬ 
structed  find  corresponding  analytical  results  can  be  obtained.  This 
process  can  go  on  find  on  with  no  end  in  sight.  Clearly,  one  is  tenpted 
to  seek  sane  order  in  these  results.  For  example,  do  there  exist  any 
invariants  in  behavior?  Can  we  bound  the  possible  range  of  performance 
regardless  of  structures?,  etc.  In  Chapter  6  we  try  to  answer  sane  of 
these  important  questions.  Fortunately,  we  are  able  to  state  a  monoton¬ 
icity  property,  a  conservation  law,  and  tight  upper  and  lewer  bounds  on 
the  system  performance  as  measured  by  average  response  time.  Exanples 
of  the  tight  bounds  are  given  for  the  exponential,  the  hyperexponential, 
the  2-stage  Erlangian,  and  the  uniform  service  time  distributions. 
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CHAPTER  3 


THE  FAMILY  OF  SELFISH  SCHEDULING  ALGORITHMS  (SSA) 

3.1  The  Mathematical  Model 

The  concept  of  selfish  scheduling  algorithms  was  first  intro¬ 
duced  by  Kleinrock  (26] .  He  solved  the  average  response  time  for  the 
selfish  round  robin  (SRR)  system.  His  work  is  extended  and  generalized 
in  this  chapter  by  obtaining  the  Laplace  transform  of  the  waiting  time 
distribution  for  fVG/1  systems. 

The  principle  behind  this  model  is  that  all  customers  in  the 
system  are  divided  into  two  groups:  those  in  a  "queue  box"  waiting  for 
service;  and  those  in  a  "service  box"  sharing  the  service  facility  in  a 
way  as  specified  by  the  specific  scheduling  algorithm  being  used  in  the 
system.  A  newcomer  always  enters  the  queue  box  where  his  priority  (a 
numerical  value)  increases  from  zero  at  a  positive  rate  a;  similarly, 
when  he  is  in  the  service  box  (he  may  be  sharing  the  service  facility  or 
he  may  be  waiting  for  his  turn  depending  upon  the  scheduler) ,  his  prior¬ 
ity  increases  at  a  positive  rate  8.  All  customers  possess  the  same  para¬ 
meters  a  and  8.  We  are  interested  in  the  region  a  >  8  >  0.  iypi- 
cally,  a  customer  enters  the  queue  box  as  soon  as  he  arrives  to  the  sys¬ 
tem,  and  starts  to  build  his  priority  with  rate  a  while  he  waits  in 
the  queue  box.  Since  a  >  6  >  0,  sooner  or  later  he  will  catch  up  with 
those  customers  in  the  service  box  and  join  them  to  share  the  service 
facility.  There  is  no  feedback  from  the  service  box  to  the  queue  box, 
although  there  may  be  feedback  within  the  service  box.  Kleinrock  (26] 

Preceding  page  blank 
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defined  a  as  the  queueing  slope  and  6  as  the  serving  slope.  If  an 
RR  scheduling  algorithm  is  being  used  in  the  service,  then  everyone  in 
the  service  box  shares  the  servioe  facility  on  an  equal  basis  and  the 
system  is  referred  to  as  selfish  round  robin  (SRR) .  If  the  server  only 
serves  those  in  the  service  box  with  the  least  amount  of  attained  ser¬ 
vice  as  in  an  IB  system,  then  the  whole  system  becomes  selfish  fore¬ 
ground-background  (SIB) .  Figure  3-1  shows  a  decomposition  of  the  sel¬ 
fish  system. 
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HXDMPOSITICN  OF  the  ssa  system 
FIGUFE  3-1 

« 

Let  us  define  the  following  quantities: 

Et  :  The  event  that  customer  needs  t  seconds  of  ser¬ 

vice 

s(t)  =  a  random  variable  representing  the  total  time  a 
customer  spends  in  the  system  conditioned  on  E^. 

w(t)  *  a  randan  variable  representing  the  total  time  a 
customer  wastes  while  waiting  in  the  system  con¬ 
ditioned  on  Efc. 

q(t)  »  a  random  variable  representing  the  time  a  cus¬ 
tomer  spends  in  the  queue  box  conditioned  on  Efc. 

y(t)  -  a  random  variable  representing  the  time  a  cus- 
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toner  spends  in  the  service  box  conditioned  on 
Ef 

v(t)  =  a  random  variable  representing  the  time  a  cus¬ 
tomer  wastes  (the  waiting  time)  in  the  service 
box  conditioned  on  Efc. 

B(t)  =  P (service  time  <  tl 

B*(s)  =  The  Laplace  transform  of  the  service  time  dis¬ 
tribution  cB(x) . 

=  f  e-SxdB(x)  (3.1) 

1/p  =  average  service  request. 

X  =  average  arrival  rate, 
p  =  x/p  *  utilization  factor  of  the  system. 

S*(t,s)  *  Hie  Laplace  transform  of  s(t,x) ,  the  equili¬ 
brium  density  function  of  s(t) . 

=  e  sxs(t,x)dx  (3.2) 

W*(t,s)  =  The  Laplace  transform  of  w(t,x) ,  the  equili¬ 
brium  tensity  function  of  w(t) . 

=  f  e’®W(t,x)dx  (3.3) 

Q*(t,s)  =  The  Laplace  transform  of  q(t,x) ,  the  equili¬ 
brium  density  function  of  q(t) . 

=  j°°  e~SXq(t,x) dx  (3.4) 

Y*(t,s)  *  The  Laplace  transform  of  y(t,x) ,  the  equili¬ 
brium  density  function  of  y(t) . 

=  |°°  e”sxy(t,x)dx  (3.5) 

V*(t,s)  =  The  Laplace  transform  of  v(t,x) ,  the  equili¬ 
brium  density  function  of  v(t) . 
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(3.6) 


-  e_SXv(t,x)  dx 
Clearly  we  have  that 

s(t)  =  q(t)  +  y(t)  (3.7) 

w(t)  =  q(t)  +  v(t)  =  s(t)-t  (3.8) 

Let  us  also  define 

T(t)  =  E  (response  time|Et] 

*  E  (s(t)] 

_  _  as*(t,s)  ,,  q» 

“  s+0  ts 

W(t)  =  E  (wasted  time  in  systen|Et] 

*  E  [W(t)J 

=  -  £im  3W*.(t>s)  =  T(t)_t  (3.10) 

W^tt)  =  second  moment  of  the  equilibrium  waiting  time 
distribution  given  Efc 


Him  3^W*(t,s) 
"  ^  3s2 


(3.11) 


o2(t) 


variance  of  the  equilibrium  waiting  time  dis¬ 


tribution 


=  W2(t)  -  W^t) 

and 

Wg(t)  =  E  (waiting  time  in  the  queue  box|Et) 

=  E  lq  ( t) ) 

Him  3Q*(t,s) 

~  “  s-+0  3s 

V(t)  =  E  (waiting  time  in  the  service  box|Et] 

*  E  (v(t)] 

Him  3V»(t,s) 
s-K)  3s 


(3.12) 


(3.13) 


(3.14) 
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since  there  is  no  feedback  fran  the  service  box  to  the  queue 
box,  and  all  the  service  is  done  in  the  service  box,  the  waiting  time  a 
customer  spends  in  the  queue  box  must  be  independent  of  his  service  re¬ 
quest.  Thus, 


q(t) 

=  q 

(3.15) 

Q*(t,s) 

=  Q*(s) 

(3.16) 

Wq(t) 

=  w 
q 

(3.17) 

3.2  The  Analysis  of  the  SSA  Systems 

By  solving  the  SSA  system,  we  mean  to  find  the  equilibriun 
waiting  time  distribution  of  the  system.  Since  we  are  unable  to  do  that 
directly,  we  first  obtain  the  Laplace  transform  of  this  distribution  and 
then  obtain  the  various  mcments  by  differentiation. 

Before  going  into  details  of  the  analysis  of  the  SSA  system,  we 
present  the  following  well-known  theorem  for  PCFS  system;  it  will  be 
used  later. 

Theorem  3.1  [19] 

The  Laplace  transform  of  the  equilibriun  density  function  of 
the  waiting  time  for  the  PCFS  system  is  given  as 

"*<*••>  =  (3-18> 

which  is  independent  of  t.  Here  p  =  A/p 

When  an  PCFS  scheduling  algorithm  is  used,  there  can  be,  at 
most,  one  customer  being  served  in  the  service  box  at  any  time  (no  pro¬ 
cessor-sharing  takes  place  in  this  case) .  The  time  a  customer  spends  in 
the  queue  box  is  independent  of  the  time  he  spends  in  the  service  box 
(t  seconds) . 
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l£t  us  look  at  the  service  box  of  the  SSA  system  as  depicted  by 
Figure  3-1.  In  order  to  solve  V*(t,s) ,  let  us  follow  a  customer,  which 
we  shall  refer  to  as  the  "tagged"  customer,  through  the  systsn  given 
that  this  customer  requires  t  seconds  of  service.  The  arrival  rate  of 
customers  to  the  service  box  conditioned  on  the  presence  of  a  tagged  cus¬ 
tomer  in  that  box  is  no  longer  A,  but  rather  some  new  average  arrived 
rate  X',  although  the  arrival  process  is  still  Poisson  [26].  Thus  (in 
conjunction  with  the  fact  that  work  can  be  done  to  a  customer  only  when 
he  is  in  the  service  box)  the  service  box  itself,  conditioned  on  the 
presence  of  a  tagged  customer,  then  becomes  a  M/G/l  system  with  average 
arrival  rate  A'  and  service  distribution  B(x).  Therefore,  V*(t,s) 
can  be  obtained  readily  from  previous  results  for  M/G/l  system  with  A ' 
replacing  A.  In  the  case  of  FCFS,  from  Eq.  (3.18) ,  we  can  write  down 

v*(t'E>  •>TBr|'sT~-'V|Jl  s  «•»> 

In  order  to  calculate  A ' ,  we  refer  to  Figure  3-2  following  Kleinrock 
[26]. 

PRIORITY 


CALCULATION  OF  THE  CONDITIONAL  ARRIVAL  RATE  TO  TOE  SERVICE  BOX 

FIGURE  3-2 
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In  this  figure,  assure  that  two  successive  customers  arrive  at  time  t^ 
and  1 2>  the  average  tire  between  tj  and  t2  is  clearly  1/A.  Let  us 
also  assure  that  these  two  customers  enter  the  service  box  at  time  t^ 
and  time  t^,  respectively,  it  is  obvious  that  the  average  distance 
between  t^  and  t^  (which  is  equal  to  1/A ')  is  larger  than  that  be¬ 
tween  t^  and  t2  because  the  customers  in  the  service  box  increased 
their  priority  at  a  rate  6  and  the  ncwccncrs  catch  up  with  than  at  a 
rate  a  as  shown  by  Figure  3-2.  In  order  to  calculate  A',  we  express 
the  vertical  offset  y  in  two  different  ways: 


y-(p)6  (3.20) 

y  -  (^,-  j)a  (3.21) 

and  so  A'  is  solved  as 

A'=A(1-|)  (3.22) 

for  convenience,  we  new  define 

p'=-  =  p(l  -  — )  (3.23) 

\i  a 


Before  we  proceed  to  find  W*(t,s)  and  Q*(s) ,  we  wish  to  es¬ 
tablish  the  independent  relation  between  q(t)  and  v(t)  for  the  SSA 
system.  That  is,  we  wish  to  prove  that  the  time  a  customer  spends  in 
the  queue  box  is  independent  of  the  time  he  spends  (or  the  time  he  wastes) 
in  the  service  box.  If  this  is  true,  we  can  find  V*(t,s)  and  Q*(s) 
independently  and  then  nultiply  than  together  to  get  W*(t,s)  (i.e., 

for  two  independent  randan  variables,  the  Laplace  transform  of  the  den¬ 
sity  function  of  their  sun  is  the  product  of  the  individual  Laplace 
transforms) .  The  task  will  be  greatly  simplified  since  V*(t,s)  is 
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available  to  us  already;  all  we  have  to  do  then  is  to  find  Q*(s)  from 


_  W*(t,s) 

Q*(s)  _  -jjg-j. 


(3.24) 


Theorem  3.2 

For  any  cus toner  requiring  t  seconds  of  service,  the  time  he 
spends  in  the  queue  box  is  independent  of  the  time  he  spends  in  the 
service  box  (or  independent  of  the  time  he  wasted  in  the  service  box 
because  t  is  not  a  randan  variable) . 

Proof:  See  Appendix  A 

By  virtue  of  Theorem  3.2,  all  we  have  to  do  now  is  to  find 
Q*(s) ,  the  Laplace  transform  of  the  probability  density  function  of  the 
waiting  time  spent  in  the  queue  box.  Before  we  proceed,  let  us  make  the 
following  observation: 


DBOQM’OSITICN  OF  THE  SSA  SYSTEM 
FIGURE  3-3 


Figure  3-3  is  a  modification  of  Figure  3-1.  Arrivals  come  into 
the  system  as  a  Poisson  stream  with  mean  arrival  rate  A.  If  the  service 
box  is  not  idle,  customers  leave  the  q  ic-ie  box  for  the  service  box  at  a 
rate  A'.  If  the  service  box  becomes  idle,  then  the  customer  with  the 
highest  priority  in  the  queue  box  (if  it  is  not  empty)  enters  inmediate- 
ly  into  the  service  box.  This  flow  of  customers  keeps  on  going  inde- 
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pendent  of  the  scheduling  algorithm  being  used  in  the  sen/ice  box. 

Fran  the  viewpoint  of  the  queue  box,  it  cbes  not  make  any  difference 
whether  an  PCFS,  an  RR,  or  any  other  scheduling  algorithm  for  this  nat¬ 
ter  is  being  used  in  the  service  box.  As  far  as  the  flow  of  cus borers 
from  the  queue  box  to  the  service  bex  is  concerned,  the  rate  is  always 
A'  if  the  service  box  is  not  idle  and  infinite  if  it  is  and  if  the 
queue  box  is  not  empty.  For  different  scheduling  algorithms,  there 
will  be  different  W*(t,s)'s  and  V*(t,s)  's,  but  their  ratio  Q*(s) 
r  ana  ins  the  same,  and  it  is  this  Q*  (s)  that  we  are  trying  to  solve 
(see  Eq.  3.24) . 

Theorem  3.3 

The  Laplace  transform  Q*(s)  of  the  density  function  of  the 
waiting  time  spent  in  the  queue  box  by  a  customer  requiring  t  seconds 
of  service  time  (actually,  it  is  independent  of  t  as  we  explained 
earlier)  is 


Q*(s) 


(1  -  p)  A'B* (s)  -  A’  +  s 
'(1  -  p')  AB*(s)  -  A  +  s 


(3.25) 


with  first  moment  equal  to 


,  _  lim  9Q*(s)  _  At2  A't2 

q  s-K)  5s  2(1  -  p)  "  2(1  -  p') 


(3.26) 


Proof:  See  Appendix  A 

By  combining  Theoran  3.2  and  Theorem  3.3,  we  get  the  following 
important  result. 

Theoran  3.4 

The  Laplace  transform  of  the  equilibrivm  waiting  time  distribu- 
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ticn  function  can  be  expressed  as 


W*(t,s)  =  Q*(s)  V*(t,s) 


(1-0) 

•  (r-'Pr 


X'B*(s)  -  X'  +  s 
XB*(s)  -  X  +  s 


V*(t,s) 


(3.27) 


and  the  mean  waiting  time  as 


W(t) 


Wg  +  V(t) 


Xt*  X'tz 

2(1  -"pT  2(1  -  o'V  +  v<t) 


(3.28) 


where 


t2dB(t) 


(3.29) 


Proof;  The  proof  is  obvious  from  the  fact  that  the  Laplace  transform 
of  the  density  function  of  the  sun  of  two  independent  random  variables 
is  just  the  product  of  the  individual  Laplace  transforms. 

3.3  The  Selfish  Round  Rabin  (SRR)  System 

We  choose  the  SRR  system  as  our  first  exanple.  Figure  3-1  is 
replotted  below  to  demonstrate  the  system  behavior.  Customers  arrive 
as  a  Poisson  stream  with  average  arrival  rate  X.  Upon  their  arrival, 
they  enter  the  queue  box  where  their  priority  will  increase  from  zero 
at  a  positive  rate  a.  After  a  customer's  priority  catches  \p  to  that 
of  those  in  the  service  box,  he  will  join  them  there  and  start  to  share 
the  service  facility.  All  customers  in  the  service  box  share  the  ser¬ 
vice  facility  equally  among  themselves  as  in  an  RR  system;  at  the  same 
time,  their  priority  increases  at  a  positive  rate  8.  The  range  of  a 
and  8  of  interest  to  us  is  when  a  >  8  >  0. 
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EEOOM’OSITCCN  OF  THE  SAA  SYSTEM 


FIGURE  3-1 


Uh fortunately  for  the  RR  system,  the  Laplace  transform  of  the 
waiting  time  distribution  has  been  solved  only  for  the  case  of  exponen¬ 
tial  service  distribution.  FOr  general  service  distributions,  only  the 
mean  waiting  times  are  available  [23,24]  as  given  by: 


(3.30) 


Since  the  service  booc  looks  like  an  RR  system  with  an  average 
arrival  rate  A ' ,  the  average  waiting  time  in  the  service  box  for  a 
customer  requiring  t  seconds  of  service  is  the  same  as  in  Eq.  (4.30) 
with  A'  replacing  A,  thus 


V<t>  - 


(3.31) 


From  Eq.  (3.26) ,  we  know  that  the  waiting  dme  in  the  queue  box 


T 

_  At  A '  t 

Wq  -  ITT  -  P)  “  ITF^p 


(3.32) 


thus,  for  M^l#  that  we  can  write  down  [from  Eq.  (3.28)] 


w(t)  =irrVpr- 


p't 
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(3.33) 


=  2(1  -"pT 


Three  exanples  are  given  to  demonstrate  the  nature  of  the  mean 
waiting  times  for  the  SRR  systems.  As  our  first  exanple,  we  choose  the 
fj/JV 1  system  (i.e. ,  the  service  times  are  exponentially  distributed) . 
For  this  case,  the  mean  waiting  time  W(t)  becomes 


(3.34) 


this  result  was  first  obtained  in  [26] . 

In  Figure  3-4,  we  plot  the  average  waiting  time  function  W(t) 
against  the  requested  service  time  t  for  different  ratios  of  6  and 
a  with  X  •=  0.75  and  p  =  1.0.  From  Figure  3-4,  as  well  as  from  Eq. 
(3.34) ,  we  observe  that  the  dependence  of  W(t)  upon  t  is  linear  for 
the  entire  family  of  SRR  systems;  and  all  the  waiting  time  functions 
intersect  at  the  same  point  (t  =  -jj) .  Thus,  the  performance  of  a  cus- 
toner  who  needs  exactly  i  seconds  of  service  time  is  the  same  that  he 
would  encounter  for  any  SRR  system.  In  Section  2.2.4,  we  had  observed 
that  correspondence  between  the  RR  system  and  the  PCFS  system;  new  we 
show  it  holds  for  the  entire  class  of  SRR  systems.  We  also  observe  that 
for  a  customer  requesting  more  than  ^  seconds  of  service,  his  waiting 
time  in  the  SRR  system  is  longer  than  that  he  would  experience  in  the 
PCFS  system;  conversely,  a  customer  who  requests  less  than  ^  seconds  of 
service  gets  better  treatment  in  the  SRR  system  than  in  the  FCFS  system. 
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AVERACT:  WATTING  TIME  FUNCTIONS  FOR  TOE  SRR  SYSTEMS  WITO 
EXPONENTIAL  SERVICE  DISTRIBUTION.  \  =  0.75,  p  =  1.0 

FIGURE  3-4 
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We  shall  shew,  through  the  next  two  exanples,  that  this  property  holds 
true  for  general  servioe  distributions  as  well,  although  the  point  of 
intersections  t.  varies  for  different  distributions.  It  can  be 

7 

easily  proved  by  referring  to  Eq.  (3.33)  that  t^  =  in  general. 

As  our  second  example,  we  choose  the  system  In  this 

system,  we  have 

=  (2p)2x  e“2pX  x  >  0  (3.35) 

with  mean  servioe  time  equal  to  1/p;  the  second  moment  of  this  distri- 
2 

but  ion  is  3/2  p  .  Figure  3-5  shows  the  behavior  of  this  system  with  p 
=  1  and  A  =  0.75.  Again,  the  response  time  curves  cross  each  other 
at  the  same  point.  It  can  be  easily  shown  that  the  point  of  intersec¬ 
tion  t^  is  at 
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t.=^=  0.75  (3.36) 

Because  the  second  moment  t  is  smaller  for  this  distribution  than  the 
exponential  distribution,  t^  is  to  the  left  of  1/p. 

In  the  third  exanple,  we  shew  the  waiting  times  for  the  H/Hj/l 
system,  where  stands  for  hyperexponential  service  distribution  with 

dB(x)  =0.5  p,e”plx  +  0.5  p-e"^  x  >  0  (3.37) 

~3x  1  2 

We  choose  p^  =  5p,  ^  =  (5/9)  p,  resulting  in  a  mean  service  time 

2 

of  1/p.  The  second  moment  of  this  distribution  is  82/25p  .  Figure 
3-6  shows  the  waiting  times  of  the  fVHj/l  system  with  p  =  1  and  A  = 
0.75.  Again,  the  waiting  times  are  linearly  proportional  to  t  and 
crossing  each  other  at  the  same  point,  only  this  time,  the  point  of 
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AVERAGE  WAITING  TIM2  FUNCTIONS  FOR  THE  SRR  SYSTEMS  WITH 
2-STACE  ERLANGIAN  SERVICE  DISTRIBUTION.  X  =  0.75,  p  =  1.0 

FIGL7FE  3-5 
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AVERAGE  WAITING  TDE  FUNCTIONS  FOR  THE  SRR  SYSTEJB  WITH 
HYPEFEXPCNENTIAL  SEFWICD  DISTRIBUTION.  \  =  0.75,  y  =  1.0 

FIGUFE  3-6 
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intersection  is  to  the  right  of  1/p  because  of  the  larger  second 
moment  of  service  time  distribution. 

As  we  mentioned  earlier,  the  Laplace  transform  of  the  waiting 
time  dis  xibution  W*{t,s)  for  the  RR  systems  has  been  solved  only  for 
the  fyfyl  case;  it  is  given  as  [25] 


W*RR(t'S) 


(1  -  p)  (1  -  pr2)  e 
(1  -  pr)2  -  p(l  -  r2) 


-A(l  -  r)t 
e~pt(l  -  pr2)/r 


(3.38) 


where  r  is  taken  as  the  smaller  of  the  two  following  expressions: 

r  =  (A+p+s  -  [(A+p+s)2  -  4pA]1/2)/2A  (3.39) 

or 


r  =  2p  (A  +  p  +  s+  [(A+p+s)2  -  4pA]1/2)_1 


(4.40 


From  Eq.  (3.25) ,  for  the  tyfyl  system,  Q*(s)  is  given  as 


.  (1  -  p)  s+p-A* 

Q  (s)  *  (1  -  p’)  s  +  p  -  X 


(3.41) 


Since  V*(t,s)  is  the  same  as  W^ftjS)  with  A'  replacing  A, 
we  can  readily  write  down  W*(t,s)  for  the  ty/^1  SRR  system  as 


W*(t,s)  =  Q*(s)  •  V*(t,s) 


(1  ~  P)  s+p-A'  _ 

(1  -  pT  s+P-A 

(1 


(1  -  p')(l  ~  P'r'2)  e~A'(1  ' 
p’r')2-  P'(l  -  r'2)e_lJt(1  “ 


r’)t 

p'r,2)/r' 


(3.42) 


where  r'  is  defined  by  Eqs.  (3.39)  and  (3.40)  with  A  replaced  by 
A'. 

By  differentiating  Eq.  (3.42),  the  first  two  moments  of  the 
waiting  time  distribution  are  obtained  as,  respectively, 
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W(t)  =  -  |s  =  0 

=  p/m  -  .pVm.  I  p*  1  t 

1  -  p  1  -  P'  1  -  P'  (3.43) 

w  (t)  =  --2w*(^s)  |s  =  0 


2p't(p  -  p1)  t  (p't)2  ,  2p't 

M(1  -  p')2(l  -  p)  (l-p1)2  y(l  -  p ') 3 


+  P.-P1)  -  JP'^-e-U  -  P ' )  Mtj 

M  (1  -  p')  (1  -  p)2  p2(l  -  p')4 


(3.44) 


In  Figure  3-7,  we  plot  the  standard  deviations  of  the  waiting 
time  versus  t  for  different  values  of  B/a.  By  catparing  Figures  3-4 
and  3-7,  we  see  that  when  t  is  small,  the  standard  deviation  t-p?vfc  to 
be  scmewhat  larger  than  the  mean  value;  and  when  t  gets  large,  the 
mean  value  tends  to  be  hitter  than  the  corresponding  standard  deviation. 
In  Figure  3-8,  we  plot  the  ratio  of  the  standard  deviation  o(t)  to 
the  average  of  the  waiting  time  against  \it  with  A  =  0.75  and  u  =  1.0. 
As  shewn  by  the  figure,  o(t)  is  mono tonical ly  nonincrMsing  with  t 
for  the  entire  family  of  the  SRR  systems.  It  can  easily  be  shown  that, 
when  t  is  large,  o(t)/W(t)  is  proportional  to  l//t.  Thus,  the  mean 
waiting  times  give  a  batter  indication  of  the  system  behavior  when  the 
requested  service  time  is  large  compared  to  1/m« 

3.4  The  Selfish  Foreground-Background  (SIB)  Systan 

The  SFB  system  is  very  similar  to  the  SRR  system  we  just  discus¬ 
sed,  the  only  difference  being  that  the  scheduling  algorithm  being  used 
in  the  service  box  is  FB  instead  of  RR.  Customers  enter  the  service  box 
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STANDARD  DEVIATIONS  OF  TOE  WAITING  TBE  FOR  THE  SRR 
SYSTEM  WITH  EXPONENTIAL  SERVICE  DISTRIBUTION.  \  =  0.75,  \i  =  1.0 

FIGUFE  3-7 
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after  they  have  experienced  sane  waiting  in  the  queue  box.  Once  a  cus¬ 
tomer  enters  the  service  box,  he  will  occupy  the  service  facility  iime- 
diately  all  by  himself  because  he  is  the  one  with  the  least  amount  of 
attained  service  in  the  service  box.  From  there  on,  this  "tagged"  unit 
sees  a  pure  FB  system  with  a  Poisson  arrival  process  at  A '  customers 
per  second  until  he  loaves  the  system  completely  served. 

The  Laplace  transform  of  the  response  time  distribution  for  FB 
systems  with  general  service  time  distributions  has  been  solved  by 
Schrage  [  27  ] .  Thus,  by  substituting  A  with  A'  in  his  results, 
we  get 

Y*(t,s)  =  H*(t,s)  (3.45) 

where 

D*(t,s)  »  B* [s  +  A'  {1  -  A*(t,s) })  (3.46) 

A*(t,s)  =  G*[t,s  +  A'  (1  -  A*(t,s) }J  (3.47) 

H*(t,s)  =  (1  -  p')  (s  +  A*  (1  -  A*(t,s) }]  (3.48) 

and  G*(t,s)  is  defined  as 


SLim  (-l)m9— (*~,S)  «  f  tmdB(t)  +  tm[l  -  B(t) ]  (3. 

s-K)  3sm  •'O 


49) 


Substitute  Eq.  (3.45)  into  Eq.  (3.27)  and  we  get 


s*(t,s)  -  fE-jS-  i  bm(s}  I  rrr  «-5°) 


From  Eq.  (3.50) ,  we  can  derive  the  first  two  moments  of  the 
waiting  time  distribution  of  the  SFB  system  as 


W(t)  =  j^Vp-y  “  37T^pT  + 


>•? 


<t 


ttP^t) 


2(1  -  p'<t) 


1  + 


(3.51) 


<t 
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W(t2) 


>'?<t  |  (>■  ?<t,2  |  t2(p'<t)2  |  vt.?<t 

3(1  -  p^t)3  (1  -  P^)4  (1  -  p;t>2  U  -  P^.)3 


(>'  ?<tl2  ,  t2<P'<t)2 


2(p  -  D')  .  ^  (P  “  P‘)  r 

V<i-p-,u-p>j  +  2m<i  %)?i  -  p')  1 


•? 


<t 


2d  -PV 


.  t‘p'<t  , 

+  i_-"pT : 1 


<t 
(3.52) 


where 


tm<fi(t)  +  tm[l  -  B(t)  ] 


(3.53) 


and 


(3.54) 


Three  exanples  are  given  to  shew  the  nature  of  the  waiting  tine 
we  just  derived.  As  the  first  exanple,  we  again  choose  the  tVH^l  sys¬ 
tem.  In  Figure  3-9,  waiting  tines  for  different  8/a  ratios  are  Jt- 
ted  against  t  with  the  assumption  that  A  =  0.75,  p  =  1.  When  8/a 
=  0,  the  system  becomes  a  pure  FB  system  with  no  waiting  in  the  queue 
box.  As  the  ratio  8/a  increases,  the  average  wait  in  the  queue  box 
also  increases  until  it  hits  the  maximum  point  when  8/a  =  1,  which 
happens  to  be  a  pure  PCFS  system.  The  curves  representing  different  8/a 
ratios  do  not  intersect  the  same  point  as  in  the  SRR  systan,  but  the 
points  of  intersections  are  relatively  close  to  each  other.  Figure  3-10 
shows  the  standard  deviations  as  plotted  against  t.  In  Figure  3-11, 
o(t)/W(t)  is  plotted  against  pt  with  \  =  0.75  and  p  *  1.  When  t 
is  small,  the  standard  deviation  tends  to  be  semswhat  larger  than  the 
mean  waiting  time,  indicating  a  rather  large  zone  for  the  mean  value  to 
vary.  When  t  gets  large,  the  ratio  c(t)/W(t)  varies  with  l//t, 
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the  standard  deviation  levels  off  faster  than  does  the  mean  value,  thus 
making  the  mean  waiting  time  a  "better"  result. 

As  our  second  example,  we  choose  the  system  iVE^l.  The  ser¬ 
vice  time  distribution  is  defined  as  Eq.  (3.35) .  Figure  3-12  shows  the 
behavior  of  this  system  with  y  =  1  and  X  =  0.75.  These  figures  are 
scnewhat  similar  to  those  for  the  M/fyl  system,  with  smaller  average 
waiting  time  in  the  queue  box.  Figure  3-13  shows  the  standard  devia¬ 
tions  as  plotted  against  t.  For  0/a  not  equal  to  zero,  the  standard 
deviation  assumes  a  rather  large  value  at  t  —  0,  then  tapers  off  a  bit 
before  going  up  again.  When  t  is  a  few  times  larger  than  the  average 
service  time  1/y,  the  mean  waiting  times  are  much  higher  than  their 
corresponding  standard  deviations,  indicating  that  relatively  anall 
range  where  the  waiting  times  can  fall. 

Once  more,  we  choose  the  M/Hy'l  system  defined  by  Eq.  (3.37)  as 
our  third  example.  With  X  ■  0.75  and  y  -  1,  mean  waiting  times  and 
standard  deviations  are  plotted  against  t  as  shewn  in  Figures  3-14 
and  3-15,  respectively.  The  average  waiting  times  in  the  queue  box  (as 
indicated  at  t  =  0  in  Figure  3-14)  are  larger  than  their  correspond¬ 
ing  terms  in  an  M/M/l  system  because  the  second  moment  of  the  hyper¬ 
exponential  service  distribution  is  larger .  Again ,  when  t  is  large 
compared  to  1/y,  the  standard  deviations  tend  to  be  less  than  their 
corresponding  mean  waiting  times  in  a  similar  manner  as  were  mentioned 
earlier  for  the  W1  and  M/E^l  systems. 

3.5  Sunmary 

In  this  chapter,  we  discussed  the  family  of  selfish  scheduling 
algorithms.  The  results  obtained  in  Section  3.2  can  be  applied  to  any 
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AVERAGE  WAITING  TIME  FTNCTICNS  FOR  THE  SFB  SYSTEMS  WITH 
EXPONENTIAL  SERVICE  DISTRIBUnON.  X  =  0.75,  \)  =  1.0 

FIGURE  3-9 
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STANDARD  DEVIATIONS  OF  THE  WAITING  TIMS  FOR  THE  SFB  SYSTEMS 
WITH  EXPONENTIAL  SERVICE  DISTRIBUTION.  X  =  0.75,  y  =  1.0 


FIGUFE  3-10 
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FIGURE  3-11 


AVERACE  WAITING  TI>E  FLNCTTCNS  FOR  THE  SFB  SYSTEMS 
WITH  2-STACE  ERLANGIAN  SERVICE  DISTR3BUITCN .  X  =  0.75,  y 
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STANDARD  DEVIATIONS  OF  THE  WAITING  TI*E  FOR  THE  ^FB  SYSTEMS 
WITH  2-STACE  ERLANGLW  SERVICE  DISTRIBUTION.  X  =  0.75,  V  =  1.0 

FIGUFE  3-13 
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AVERAGE  WAITING  TDC  FUNCTIONS  FOR  THE  SFB  SYSTEMS  WITH 
HYPEFEXPCNENTIAL  SERVICE  DISTRIBUTICN.  \  =  0.75,  \i  =  1.0 

FIGURE  3-14 
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STANDARD  DEVIATIONS  OF  THE  HATTING  TIME  FOR  THE  STB  SYSTEMS 
WITH  HYPEIEXPCNENEEAL  SERVICE  DISTRIBUTION.  \  -  0.75,  y  =  1.0 

FIGUFE  3-15 
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SSA  system.  Two  parameters,  a  and  3,  are  introduced  to  the  system  so 
that  one  degree  of  freedom  (appearing  as  the  ratio  l/°0  is  provided  to 
the  system  designer  which  he  can  use  to  adjust  the  system  performance 
as  a  function  of  service  time  over  a  continuum  of  service  behaviors . 

TVo  specific  systems,  the  selfish  round-robin  and  the  selfish 
foreground— background,  are  described  in  detail  to  demonstrate  the  nature 
of  the  results.  For  the  SRR  syst'r,  the  average  response  time  always 
varies  linearly  with  the  requested  service  time;  the  Laplace  transform 
of  the  waiting  time  distribution  is  available  only  for  the  exponential 
service  time  distribution.  When  the  SFB  algorithm  is  used  by  the  sys¬ 
tem  scheduler,  the  Laplace  transform  of  the  response  time  distribution 
is  obtained  for  H^G/l  system  in  general.  Exanples  for  exponential, 
Erlangian,  and  hyper  exponential  service  distribution  are  given  in  Sec¬ 
tions  3.3  and  3.4. 

Once  again  we  wish  to  emphasize  the  generality  of  the  results 
obtained  in  this  chapter.  Given  the  result  of  any  tyVl  system, time- 
shared  system  analysis,  the  result  for  the  corresponding  selfish  system 
can  be  readily  obtained  by  referring  to  Eq.  (3.27)  (Laplace  transform 
of  the  waiting  time)  and  Eq.  (3.28)  (the  average  waiting  time) . 
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CHAPTER  4 


A  CONTINUUM  CF  fEEDBACK  SCHEDULING  ALGORITHMS 

4.1  The  Mathematical  Model 

In  Chapter  3  we  discussed  the  family  of  selfish  scheduling  al¬ 
gorithms.  In  particular,  we  discussed  the  SRR  and  the  SFB  sustains  where 
the  performance  of  those  systems  varies  over  a  continuun  of  system  be¬ 
haviors  with  the  PCFS  system  at  one  end  of  the  continuun  (i.e.  0/a  =  1) . 
The  RR  system  lies  at  the  other  end  of  the  continuum  (i.e.  3/a  =  0)  for 
the  SRR  systems;  and  the  FB  system  lies  at  this  end  (6/a  =  0)  for  the 
family  of  SFB  systems.  In  this  chapter  we  look  into  another  family  of 
scheduling  algorithms  whose  performance  also  ranges  over  a  continuun  of 
system  behaviors. 

In  Section  2.2.5  we  discussed  the  RR  system  with  externally 

assigned  priorities.  In  such  a  system,  we  assure  that  there  are  P 

priority  grocps  each  with  Poisson  arrival  process,  and  an  exponentially 

distributed  service  request  for  customers  from  each  group.  A  positive 

nunber  q  is  associated  with  the  0th  priority  group,  with  larger 
P 

values  of  gp  being  given  to  those  higher  priority  groups.  All  gp's 
are  assumed  to  be  of  constant  values,  g  can  be  interpreted  as  the  rela- 
tive  fraction  of  the  total  service  facility  (the  CPU  time)  that  is  allo¬ 
cated  for  the  customers  from  the  pth  priority  group. 

The  model  we  introduce  in  this  chapter  is  an  extension  of  this 
Round  Robin  system  with  priorities.  The  g _'s  are  no  longer  assigned 

r 

externally,  rather  they  are  assigned  to  the  customers  acoording  to  their 

Preceding  page  blank 
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attained  services  t  (which  also  inplies  that  we  transform  the  dis¬ 
crete  priority  system  into  a  continuous  priority  system) .  In  particular, 
we  let  g(t)  vary  exponentially  with  t,  namely, 


g(t)  =  ge-gt 


(4.1) 


g(t) 


g  AS  FUNCTION  OF  ATTAINED  SERVICE  TIME 
FIGUFE  4-1 

As  shown  by  Figure  4-1  and  Eq.  (4.1) ,  all  customers  in  the  system  share 
the  service  facility  simultaneously,  and  customers  with  lesser  amounts 
of  attained  service  get  served  at  a  higher  rate  than  those  with  greater 
attained  services.  The  more  service  a  customer  has  received,  the  slower 
he  will  get  served.  Let  n(t)  denote  the  density  of  the  nunber  of  cus¬ 
tomers  in  die  system  with  t  seconds  of  attained  service  (Section 
2.2.9) .  Since  liis  rate  of  attaining  service  is  directly  proportional 
to  g(t) ,  the  fraction  f (t)  (a  random  variable)  of  the  total  service 
facility  allocated  to  a  customer  with  t  seconds  of  attained  service 
is  calculated  as 


f(t) 


q(t) 

g(t)n(t)dt 


(4.2) 
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where  c  is  a  constant  indep  endent  of  t. 

The  RR  system  is  a  special  case  of  this  family  of  algorithms. 
With  very  snail  value  of  g,  the  slope  of  the  exponential  curve  drops 
very  slowly  with  t,  and  so  the  discrimination  against  customers  with 
large  attained  services  (as  measured  by  the  relative  rates  at  which 
they  accumulate  service  time)  decreases  as  g  decreases.  As  g  goes 
to  zero,  the  exponential  curve  in  Figure  4-1  becomes  a  horizontal  (non- 
discriminatory)  line,  and  every  customer  get  treated  equally  at  all 
times  (independent  of  his  attained  service  time) .  This,  of  course,  be¬ 
comes  the  Round  Robin  systan. 

On  the  other  hand,  if  we  let  the  value  of  g  be  very  large,  the 
curvature  of  g(t)  becomes  very  steep.  This  means  that  a  customer  with 
a  slightly  lesser  amount  of  accumulated  service  time  will  be  served  at  a 
nuch  higher  rate  than  another  customer  in  the  system  with  slightly  more 
attained  service.  The  higher  the  value  of  g,  the  more  pronounaed  is 
this  discrimination  against  long  jobs.  As  g  goes  to  infinity,  the 
only  time  a  customer  can  get  into  the  service  facility  is  when  he  is  the 
customer  with  the  least  amount  of  attained  service  in  the  systan,  thus 
the  system  becomes  FB. 

The  parameter  g  can  assume  ary  value  between  0  and  infinity. 
By  varying  g,  a  degree  of  freedom  is  provided  to  the  systan  designer 
which  he  may  use  to  adjust  the  systan  performance  over  a  continuum  of 
behaviors  with  the  RR  and  the  FB  systems  serving  as  the  two  boundaries. 

4.2  The  Analysis 

We  consider  the  case  WWl  with  scheduling  algorithm  g(t)  as 
defined  above. 


71 


Theorem  4.1 


The  average  response  time 
uling  algorithm  defined  by  g(t)  = 
ing  integro-differential  equation 


T(t)  for  the  system  with  the  sched- 
ge'"9t,  is  the  solution  of  the  follow- 


(1  -  p)T(t)  -  t  +  -  p  /  T’(t)  [e9*-  +  e^-lj'^dt 

l  -  P  •/(, 


j. 

-  P  f  T'  (t)  (e9*  -  e^+ir^dT  (4.3) 

0 

Proof;  See  Appendix  B 

In  Eq.  (4.3),  if  g  =  0,  it  gives  us  the  result  of  the  HR  system. 
As  g-»0,  the  term  [e91  +  e9*"  -  1J  ^9  becomes 

$  iw"  *  -  11  ^  -  ${[1  +  gr  +  1  +  gt  - 

*  £S{<!  +  9tt  ♦  T>]1/9}  " 

■[•t+Tr 

-  e-  (t  +  T>  (4.4) 

The  limit  of  [e9^  -  e91  +  1J**^9  as  g  goes  to  zero  can  similarly  be 
calculated  as 


£im 

g-K) 


(e9t  .  egr  +  n-w/g 


^{[i  +  gt  -  i  -  gt  ♦  U 1/9  J' 

${u  +  get  -  m1*p 
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(4.5) 


-  [.«*  -  <>) 


-V 


=  e-u(t  -  t) 


Substituting  Eqs.  (4.4)  and  (4.5)  into  Eq.  (4.3),  we  get 


(1  -  p)T(t)  =  t  +  . -of  T'  (t)e_y(t  +  T)dt  ”  p  f  T*  (t)e_y(t  “  T)dx 


=  t  +  -  pe_yt  f  T' 

J-  ~  P  •'rs 


.  J"  (T)e  yTdi  -  pe-yt"/*  T'(T)eyTdt 

p  */0  •'o 


Since  fran  [36] ,  we  have 


f  T'(T)e_lJTdT  - 


1  -  P 


(4.6) 


(4.7) 


Eq.  (4.6)  becomes 

(1  -  p)T(t)  -  t  +  eyt-  p e~vtf  T'(T)eUTdT  (4.8) 


Miltiply  eyt  on  *x)th  sides  of  Eq.  (4.8)  to  yield 


(1  -  p)eytT(t)  =  teyt  +  fffi_(eyt-  1)  -  p  f  T'(T)eyTdi  (4.9) 


New  differentiate  Eq.  (4.9)  with  respect  to  t,  to  get 
(i  -  p)ytT'(t)  +  u(l  -  p)T(t)eyt 

=  (1  +  wt)eut  +  j-2-^  eyt  -  peytT'(t)  (4.10) 

Multiplying  e_yt  on  both  sides  of  Eq.  (4.10)  and  sinplifying,  we  get 
T'(t)  +  u(l  -  p)T(t)  =  +  ut  (4.11) 
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With  the  initial  condition  T(0)  =  0,  Eq.  (4.11)  is  easily  solved  as 


T(t>  -  r^-p 


(4.12) 


which,  of  course,  is  the  average  response  time  for  the  RR  system. 

On  the  other  hand,  when  g  goes  to  infinity,  for 
f  gt  gr  .  "1 

e’  +  e3  -  1 1  ,  we  must  consider  two  cases: 

Case  1)  when  x  <  t,  then  1  «  e91  «  e9^ 


cr°°l 


Case  2)  when  x  >  t,  then  .1  «  e9^  «  e9 


;[egt  +  e<F  .  .  H^]' 


(4.13) 


=  e-yT 


(4.14) 


Similarly,  the  limit  of  ^e9*"  -  e91  +  lj  as  g  goes  to  infinity 


becomes 


sy*  -  ^  -  if*  -  ssw 


X  <  t 


(4.15) 


Substituting  Eqs.  (4.13),  (4.14)  and  (4.15)  into  Eq.  (4.13),  we  get 


(1  -  p)T(t)  =  t  + 


-P/.F.  _ 

1  -  P  Jn 


T'  (x)e  ^dx 


o  f  T'(x)e”yxdx  -  p  f  ' 


T' (x)e”ytdr 


(4.16) 
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or 


(4.17) 


(1  -  p  +  2pe  lJt)T(t)  =  t  +  ~  of  T’(T)e”MTdT 

Differentiating  Eq.  (4.17)  with  respect  to  t  yields 

T'(t)(l  -  p  +  2pe_yt)  -  2Ae**ytT(t)  =  1  +  p  T’(t)e-yt  (4.18) 
Rearranging,  we  get 

T'(t)(l  -  p  +  peyt)  =  2Ae_ytT(t)  +  1  (4.19) 

With  the  initial  condition  T(0)  =  0,  the  solution  of  Eq.  (4.19)  is 


T(t) 


.  ufl  -  e-^-  Pte-1*]  + _ t. 


(1  -  p  +  pe“yt)2 


1  -  p  +  pe 


-pt 


(4.20) 


which,  of  course,  is  the  average  response  time  for  the  IB  system  (with 
exponential  service  time  distribution) . 

Unfortunately,  we  cannot  solve  Eq.  (4.2)  analytically  in  general 
except  for  the  special  cases  with  g  =  0  and  g  =  °°.  An  approximate 
solution  of  Eq.  (4.2)  with  large  values  of  c  is  presented  in  the  next 
section. 


4.3  An  Approximate  Solution 

When  g  is  large  ocrrpared  to  p,  the  term 
car.  be  approximated  by  using 


e^T+  egt-  1 


-p/g 


(1  +  e)k  =  1  +  ek  for  e  «  1 


(4.21) 


We  consider  two  cases: 

Case  1)  t  <  t 


|egt  +  e*  -  -  1)| 


-p/g 
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( 


-  e‘yt[l  -  (p/gle^e9"1  -  1)] 

Case  2)  t  >  t 

|e9t  +  egr  .  i]'^9  „  e-|JT  1 1  +  e^T(egt  -  1)  | 

*  e"MT[l  -  (P/gle^fe9*  -  1)] 

1  l-y/g 

e^~  -  e91  +  lj  can  be  approximated  as 

[e9*  -  e91  +  l]-^  =  e'vt\l  -  e^le91  -  Up9 

-  e"Mt[l  +  (p/g)e“gt(egT  -  1)] 
Substituting  Eqs.  (4.19)  to  (4.21)  into  Eq.  (4.2),  we  get 

(1  -  p)T(t)  =  t  +  -  p /V(t)  [e"yt  -(y/g)e‘(g+M)t(e9T  - 

1  ”  P  •/o 

-  p  f  T'  (t)  le”^  -  (p/g)e  ^g+^T  (e*^ 

~  P  f  T’(x)dT[e  +  (p/g)e  ^9flJ)t(e9 


Simplifying  Eq.  (4.25),  we  get 

T(t)  (1  -  p  +  2pe~yt)  «  t  +  ^^r-p  -  pf  T'  (x)e~yTdT 

+  pte9*  -  1  )f  (M/g)e_(p+9>TT'(T)dT 
Differentiating  Eq.  (4.26)  with  respect  to  t  yields 


(4.22) 

(4.23) 

(4.24) 

l)]dx 

-  l)]dx 

T  -  l)]dT 

(4.25) 

(4.26) 
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T'(t)[l  -  p  +  pe“pt  +  A/g(e* ^  -  l)e‘(p+g)t]  -  2AT(t)e~Xt 

=  1  +  Xe^  f  T'(T)e”(u+g)Tdx  (4.27) 

Fran  Eq.  (4.27),  by  setting  t  =  0,  and  recognizing  that  T(0)  =  0,  we 
get  the  initial  condition 

T* (0)  =  1  +  /*  AT'(T)e“(M+g}TdT  (4.28) 

*,0 

Multiplying  e-^  to  both  sides  of  Eq.  (4.27)  and  then  differentiating 
it  with  respect  to  t,  and  rearranging  the  result,  we  get 

T'(t) (1  -  p  +  pe‘pt  +  A/g(egt  •  l)e-(p+g)t] 

+  T'(t)  [-g  +  gp  -  gpe~pt  -  Ae“pt  -  e‘(g+p) fc(egt  -  1)A(2  +  u/g)] 

+  2A(p  +  g)e“ptT(t)  +  g  =  0  (4.29) 

with  the  initial  conditions 

T(0)  =  0 

T'(0)-1+A  f  T'(x)e'(p+g,TdT  (4.30) 

y0 

Milne's  mechod  [37]  is  used  to  solve  Eq.  (4.30).  Since  the 
initial  condition  T'(0)  is  given  in  terms  of  sore  unknown  results, 
many  iterations  are  needed  in  order  for  the  results  to  converge.  W(t) 
is  plotted  against  t  on  Figure  4-2  with  A  =  0.75  and  \i  =  1. 
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CHAPTER  5 


THE  AVERACE  NUM3ER  OF  CUSTOMERS  IN  THE  SYSTEM 

5.1  Introduction 

In  Section  2.1.3  we  discussed  the  memory  less  property  of  Mar¬ 
kovian  process.  In  particular,  the  average  nunber  of  customers  in  an 
M/M/1  system  was  given  by 

"  ■  J>n  ■  (5-1) 

where  pn  is  the  equilibrium  probability  of  having  n  customers  in 
the  system  and  is  given  as 

Pn=(l-P)pn  n  =  0,1, 2, 3,. . .  (5.2) 

Equation  (5.1)  holds  true  only  when  no  more  information  about  the  sys¬ 
tem  is  available  besides  that  the  arrival  and  service  processes  are 
Markovian.  Otherwise,  the  average  number  in  the  system  will  change 
according  to  the  additional  information.  For  exanple,  let  us  assume 
that  the  system  is  not  idle  when  the  average  is  taken;  in  other  words, 
there  is  at  least  one  customer  in  the  system,  and  ask  what  is  the 
average  nunber  of  customers  in  the  system  under  this  condition. 

Equation  (5.2)  then  becomes 

Pn=(l-P)pn"1  n-  1,2,3,...  (5.3) 

and 

p0  =  0  (5.4) 
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The  average  nmber  of  customers  in  the  system  becomes 


00 


«  x  Z»n(1  ■  p)pn 


n=0 


-I .  _e__ 

p  i  -  p 


which  is  just  the  overall  average  number  p/(l  -  p)  divided  by  p,  the 
probability  that  the  system  is  busy. 

Both  Bqs.  (5.1)  and  (5.5)  were  obtained  under  the  asunption 
that  no  specific  information  about  any  single  customer  was  available; 
all  customers  were  assumed  to  be  identically  distributed  as  far  as 
their  inter  arrival  and  service  times  were  concerned.  In  most  of  the 
time- sharing  models,  however,  the  quantity  that  is  solved  for  is  the 
average  response  time  in  the  system  conditioned  on  a  tagged  customer 
needing  exactly  t  seconds  of  service  time.  In  those  systems,  one  of 
the  customers  in  the  system  [the  tagged  customer]  behaves  differently 
from  others.  His  sendee  time  request  becomes  deterministic  and  is  no 
lonqer  drawn  from  an  exponential  distribution  as  are  all  other  service 
times. 

Given  this  additional  information  (that  one  customer  will  not 
leave  the  system  before  he  has  obtained  exactly  t  seconds  of  servioo) , 
one  would  expect  that  the  average  mrtoer  of  customers  in  the  system  will 
change  as  a  function  of  this  t.  In  fact  it  does  as  is  shown  by  the 
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following  theorem  in  a  very  sinple  way. 

5.2  The  Analytical  Results 
Theoran  5.1 

The  average  number  of  customers  m(t)  in  an  system 

given  that  one  (the  tagged)  customer  has  attained  t  seconds  of  service 
is  given  by 

m(t)  =  j-^-p  +  *T(t)  ’  pW(t)  (5.6) 

where  T(t)  and  W(t)  are  the  average  response  and  waiting  time, 
respectively,  for  the  tagged  customer. 

Proofs 


l~ 

4- 

T0 


T(t) 


- 1 - 

Tq  ♦  T(t) 


TIME 


THE  PEAL  TDC  AXIS 
FIGUFE  5-1 

The  proof  of  this  theorem  is  very  sinple.  Let  us  look  at  the 
real  time  axis  as  shown  in  Figure  5-1.  Let  the  tagged  customer  arrive 
at  an  arbitrary  time  instant  TQ  .  Since  no  er'  information  about 
the  system  is  available,  there  are  on  the  average  1/(1  -  p)  customers  in 
the  system  (including  this  just  arrived  tagged  customer)  at  Tg.  At 
time  [Tg  +  T(t)  ] ,  the  tagged  customer  has  spent  T(t)  seconds  in  the 
system  and  thus  has  been  in  the  service  facility  for  an  average  of  t 
seconds.  During  the  interval  [Tg  Tg  +  T(t)  ] ,  on  the.  average,  there 
are  \T(t)  [Poisson  arrival]  new  customers  arriving  to  the  system. 
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There  are  p(T(t)  -  t]  customers  leaving  the  system  because  the  death 
rate  becomes  zero  when  the  tagged  customer  is  served  (no  customer  can 
leave  the  system  when  the  tagged  customer  is  in  the  service  facility) . 
The  average  number  of  the  customers  m(t)  in  the  system  at  Tq  +  T(t) 
is  then  calculated  as 

m(t)  =  +  AT(t)  "  VJ[’r(t)  " 

=  +  *T(t)  -  uW(t)  (5.7) 


Q.E.D. 

Equation  (5.7)  can  be  rearranged  in  one  of  the  following  two 
different  forms. 


or 


m(t)  =  T”=~p  ~  (VJ  “  A)W(t)  +  xt 


m(t)  “  T~="p  "  (lJ  "  X)T^  +  wt 


(5.8) 

(5.9) 


Since  the  average  number  of  customers  in  a  busy  system  with  no 
constraints  is  1/(1  -  p) ,  the  weighted  average  of  m(t)  for  ail  possi¬ 
ble  t  nust  be  equal  to  1/(1  ~  p) .  This  can  be  easily  shown  by  using 
the  Conservation  Law  (Eq.  (2.78)) 

y,«.  00 

m(t)dB(t)  =  f  (t-^— +  AT(t)  -  pW(t)  ]  ye-lJtdt 

0  Jo  1  p 

=  j-r-r  +  -  Vi/‘°W(t)ue"ptdt 

=  .  P/1L, 

1  ~  p  1  -  p  1  -  p 


1  -  p 


(5.10) 
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For  the  RR  system,  the  average  response  time  is  linearly  pro¬ 
portional  to  t  and  is  given  by 

-  r=T  (5-lu 

substituting  Eg.  (5.11)  into  Eq.  (5.9) ,  the  average  nuntoer  of  customers 
in  a  RR  system  given  that  a  customer  in  the  system  has  obtained  t 
seconds  of  service  time  becomes 


V0  *  rH  -  +  pt 


1  -  P 


-  pt  +  \it 


1  -  P 


(5.12) 


which  is  independent  of  t  and  always  assures  the  same  value.  As  a 
consequence  of  Eq.  (5.12) ,  we  get  the  following  theorem: 

Theorem  5.2 

Ft>r  any  scheduling  algorithm  and  its  corresponding  average 
response  time  function  T(t) ,  if  T(t)  assumes  the  same  value  as  the 
average  response  time  function  Tp^(t)  (for  the  RR  algorithm)  at 
some  t  =  t^,  then  at  this  point  t^  (that  is,  given  the  information 
that  one  of  the  customers  in  either  system  has  attained  exact  t^ 
seconds  of  service  time) ,  the  average  nunber  of  customers  in  both 
systems  equal  to  1/(1  -  p) ,  i.e.  if 


Proof: 


If  we  substitute 


TRR(t)“  1  -  p 


into  Eq.  (5.7),  then  n^t)  becomes 


"Wtt  ■ 


(5.13) 


the  average  number  of  c  us  tamers  in  a  constant  in  the  RR  system,  indepen¬ 
dent  of  t. 

Now,  if  for  soime  scheduling  algorithm  which  gives  an  average 
response  time  T(t) ,  it  intersects  with  Tp^t)  at  t  =  t^,  then 

fci 


and 


T(ti)  ”  TRR(ti}  ~  1  -  p 


w(ti>  -  VV  = 


(5.14) 


Substituting  Eq.  (5.14)  into  Eq.  (5.7),  we  get 


m(t.)  = 


Xti 


o + 1  -  p "  i  -  p  u 


i  -  p 


(5.15) 


Q.E  *D. 


5.3  Exanples 

Equation  (5.7)  is  good  for  all  fV'^l  systems,  be  it  infinite- 
quantum  or  processor-sharing.  In  this  section  we  wan,,  to  plot  the  aver¬ 
age  number  of  customers  for  different  scheduling  algorithms  to  demon¬ 
strate  the  nature  of  Eq.  (5.7) . 
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5.3.1  First-Cane-First-Served  (FCFS)  System 


Hie  average  waiting  time  is  given  by  Eq.  (2.19)  as 

W(t)  =  ^  (5.16) 

substituting  Eq.  (5.16)  into  Eq.  (5.8)  gives 

m(t)  =  p  ■ +  xt 


e  1  -  P  +  P2  +  At  (5.17) 

l-o 

m(t)  is  plotted  against  t  in  Figure  5-2  with  A  =  0.75  and 
y  s=  1.  Hie  average  nurrfoer  of  customers  increases  with  t  because  once 
the  tagged  customer  enters  the  service  facility,  he  will  occupy  the 
facility  all  by  himself  and  thus  block  all  those  who  come  to  the  system 
later  than  he  does.  Hie  nurrber  of  "blocked"  customers  increases  with 
rate  X,  and  if  t  goes  to  infinity,  so  goes  the  average  nurber  of 
customers  in  the  system. 

5.3.2  Round  Robin  (RR)  System 

As  proved  by  Theorem  5.2,  the  average  number  xr.(t)  is  always 
equal  to  1/(1  -  p)  in  the  RR  system.  It  serves  as  a  natural  reference 
algorithm  for  all  other  algorithms. 

5.3.3  Selfish  Round  Robin  (SRR)  System 

Hie  average  response  time  for  the  SRR  system  is  given  by  Eq. 
(2.35)  as 
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(5.18) 


(t  -  1/u) 

1  -  p(l  -  B/a) 


Substituting  Eq.  (5.18)  into  Eq.  (5.9),  m(t)  is  obtained  as 


P 


(u  -  X)  t 


1/u 
1  -  p 


+ 


(t  -  1/M) 

1  -  p(i  -  B/a) 


]  +  ut 


1  -  p 

1  -  p(i  -  6/r) 


+  ut 


p(BA0 

1  -  p(l  -  B/a) 


(5.19) 


m(t)  is  plotted  ageinst  t  for  the  case  of  B/a  =  0.2  in  Figure  5-2 
with  X  =  0.75  and  u  '  1.  The  behavior  of  the  SRR  system  is  again 
bounded  by  those  of  the  HR  and  the  FCFS  systems.  The  average  nurrber 
goes  to  infinity  but  at  a  slower  rate  than  that  for  the  FCFS  system 
because  the  "blocking"  effect  is  not  as  severe  in  the  SRR  system.  All 
SRR  curves  give  m(t)  =  1/(1  -  p)  for  t  =  1/u  • 


5.3.4  Foreground  Background  (FB)  System 

Substituting  Eg.  (2.50)  into  Eg.  (5.9),  m(t)  for  FB  system  is 


expressed  as 


m(t)  =  y-— -  -  (u  -  X) 


,(p/u)(l  -  e'ut  -ute~ut)  + 
(1  -P  +  pe"ut)2 


1  -p  +  pe 


+  ut 


(5.20) 

this  is  plotted  against  t  in  Figure  5-2  with  A  =  0.75  and  y  =  1. 

When  t  is  small,  the  tagged  customer  receives  service  at  a  rate  larger 
than  most  of  the  other  customers  in  the  system,  thus  causing  a  smaller 
death  rate  and  the  number  of  customers  in  the  system  increases.  As  his 
attained  service  time  accumulates ,  the  tagged  customer  constantly  loses 
priority  and  gets  served  at  decreasing  rate.  The  death  rate  of  the 
system  increases  as  t  increases,  and  there  are  fewer  and  fewer 
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customers  remaining  in  the  system.  When  t  gets  very  large,  the  only 
time  the  tagged  customer  can  get  any  service  is  when  he  is  the  only 
customer  in  the  system,  thus  the  average  nuiber  goes  down  to  1  as  t 
goes  to  infinity. 

5.3.5  Multilevel  Processor-Sharing  Models 

Let  us  use  a  two-level  mo^el  for  our  example.  FCFS  algorithm  is 
used  for  both  levels,  and  let  x  denote  the  break  point.  Containing 
Eqs.  (2.57)  and  (5.7),  m(t)  is  given  by 


-  (U  -  X)  It  ♦  + 

1  _  ~  ^  -UX 


Mt 


1  -  p  +  pe 


(u  -  X)  P--L-  +  Pt 


1  -  P  +  pe 


-MX 


t  >  x 


t  <  x 


(5.21) 


In  Figure  5-2,  m(t)  for  x  =  2  and  x  =  3  are  plotted  against  t 
with  X  =  0.75  and  p  =  1.  m(t)  increases  at  tlie  same  rate  as  in  a 
FCFS  system  when  t  is  smaller  than  x  because  the  tagged  customer  is 
the  only  one  in  the  service  facility  and  he  blocks  all  the  late  comers 
as  well  as  all  those  in  the  system  who  have  thus  far  received  more  than 
x  seconds  of  service.  Having  attained  x  seconds  of  service,  th~ 
tagged  customer  moves  to  the  lower  priority  group,  and  he  must  wait 
there  until  the  system  serves  all  those  customers  previously  blocked  by 
him  up  to  x  seconds  each  as  well  as  those  who  have  received  more  than 
x  seconds  of  service  time  and  were  preenpted  by  him  when  he  entered  the 
senrae  in  the  lower  level.  This  accounts  for  the  large  drop  of  custom¬ 
ers  at  t  =  x  .  Even  after  he  regains  control  of  the  service  facility 
at  seme  later  time,  the  tagged  customer  is  always  subjected  to  preemption 
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whenever  a  new  customer  arrives.  The  tagged  customer  still  can  block 
those  customers  who  need  more  than  x  seconds  of  servioe  after  they 
have  gained  that  iruch,  but  the  rate  of  increase  of  customers  in  the 
system  is  rather  low  if  x  is  considerably  larger  than  the  average 
service  request  1/p  . 

5.3.6  Ticftt  Upper  and  Lower  Bounds 

In  Chapter  7,  tight  upper  and  lower  bounds  as  measured  by  the 
response  time  function  are  derived  for  the  processor-sharing  models  for 
time-shared  systems.  They  are  given  by,  respectively, 


Tu(t) 


_ p/P _ +  t 

(1  -  p  +  pe"Mt)  (1  -  p)  1  -  p  +  pe"yt 


(5.22) 


Tt(t) 


p/u(l  -  e~tlt  -  pt^-|it) 


1  -  p  +  pe 


-pt 


(5.23) 


In  Eq.  (5.9) ,  T(t)  appears  only  once  and  has  a  minus  sign  in  front  cf 
it.  For  a  given  t,  if  we  substitute  the  tight  upper  bound  Tu(t)  c  f 
T(t)  into  Eq.  (5.9),  we  get  the  tight  lower  bound  of  the  nuttier  of  cus¬ 
tomers  in  the  system.  Similarly,  the  tight  upper  bound  of  m(t)  is 
obtained  by  substituting  the  tight  lower  bound  T^(t)  of  the  response 
time  functions  into  Eq.  (5.9) .  The  tight  bounds  of  the  average  number 
of  customers  in  the  system  are  plotted  against  t  in  Figure  5-  2  with 
A  =  0.75  and  p  =  1.0.  As  is  shown  by  Figure  5-2,  as  t  goes  to  infin¬ 
ity  the  lower  bound  goes  to  1;  and  the  upper  bound  approaches  the  average 
number  for  the  FCFS  system  asynpotically. 

In  Figure  5-3,  m(t)  is  plotted  against  W(t)/t  which  denotes 
how  large  a  price  (in  terms  of  wasted  time)  a  customer  must  pay  in  order 
to  get  a  unit  of  servioe  with  A  =  0.75  and  p  =  1.0  for  different 
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TIME-VARYING  AVERAGE  NUMJER  OF  CUSTONERS  PLOTIED  AGAINST  t 
FOR  DIFFEraNT  ALGORITHMS  WITO  X  =  0.75,  M  =  l 

FIGUre  5-2 
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algorithms.  For  RR,  both  m(t)  and  W(t)/t  are  constants,  it  says  that 
every  customer  pays  the  same  amount  for  a  unit  of  servi.e  time  inder  in¬ 
dent  of  his  attained  service,  and  the  average  nurber  of  customers  in  the 
system  does  not  change  with  t  or  any  function  of  t  .  For  all  other 
algorithms,  m(t)  is  larger  than  1/(1  -  p)  when  W(t)/t  is  smaller  than 
p/(l  -  p) .  The  interpretation  of  this  phenomenon  is  that  with  w(t)/t  < 
p/(l  -  p) ,  the  tagged  customer  has  been  treated  better  than  the  average 
(as  represented  by  the  RR  system)  so  that  his  occupancy  in  the  service 
facility  has  been  somewhat  longer  than  that  he  would  encounter  in  a  RR 
system.  As  a  consequence  of  this,  the  average  death  rate  (rate  of  de¬ 
partures)  of  the  customers  in  the  system  is  somewhat  lower  than  the 
average  arrival  rate  during  his  stay  in  the  system  (since  for  RR  system, 
these  two  rates  are  the  same) ,  thus  the  average  number  of  customers  in 
the  system  goes  up.  Ch  the  other  hand,  m(t)  decreases  as  W(t)/t 
goes  above  the  constant  p/(l  -  p)  since  tlie  average  departure  rate 
from  the  system  would  then  be  higher  them  the  average  arrival  rate  to 
the  system.  For  the  FB  system  (refer  to  Figure  3.9  where  W(t)  is 
plotted  against  t) ,  the  function  W(t)/t  increases  with  t  until  it 
hits  its  maximum  value  (at  t  =  5.25  for  X  =  0.75  and  p  =  1) .  it 
then  starts  to  decrease  as  t  increases  and  approaches  1/(1  -  p)  as 
t  goes  to  infinity;  at  the  same  time  the  average  number  of  customers 
in  the  system  goes  dewn  to  1  asynpotically.  This  accounts  for  the 
hook-shaped  curve  for  the  FB  system  in  Figure  5-3. 
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CHAPTER  6 


TICHT  BOUNDS  CN  THE  AVERAGE  IESPCNJE  TIME* 

6.1  Introduction 

In  the  previous  chapters  of  this  dissertation  we  discussed  a 
few  irodels  of  time-shared  ccrputer  systems.  By  slightly  changing  the 
set  of  assumptions  for  those  systems,  more  models  could  be  constructed 
and  no  re  analytical  results  could  be  obtained.  As  a  result  of  this 
flood  of  results,  it  is  natural  that  we  should  seek  sane  order.  For 
exanple,  do  there  exist  any  invariants  in  behavior?  Can  we  bound  the 
possible  range  of  performance  regardless  of  structure?  VJhat  constitute 
feasible  solutions  for  these  systems?  These,  and  many  more,  are  reason¬ 
able  inquiries  to  make  amidst  the  confusion  of  results. 

In  this  chapter  we  try  bo  answer  some  of  the  questions.  Our 
focus  is  on  a  class  of  processor-sharing  models  of  time-shared  ccrputer 
systems.  For  these  processor-shared  systems,  it  is  useful  to  display, 
in  one  figure,  the  wasted  time  W(t) .  This  we  do  in  Figure  6-1  for  the 
case  of  expontial  service  with  X  =  0.75  and  t  =  1.0  (thus  p  -  0.75) . 

We  purposely  superimpose  the  performance  curves  for  many  scheduling  dis¬ 
ciplines.  We  are  confronted  with  quite  a  selection  of  possible  perform¬ 
ance  functions!  For  these  systems  we  are  able  to  state  a  monotonicity 
property,  a  conservation  law,  and  tight  upper  and  lower  bounds  on  the 
system  performance  as  measured  by  average  response  time. 

♦These  results  were  obtained  in  collaboration  with  L.  Kleinrock  and 
R.  R.  Muntz  144] . 

Preceding  page  blank 
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A  SET  OF  IESPCNSE  CURVES  FDR  M/M/1,  V  =  1.0,  X  =  0.75 

FIGURE  6-1 
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It  is  worthwhile  mentioning  that  numerous  papers  have  recently 
been  published  which  address  themselves  to  bounds,  inequalities  and 
approximate  solutions  to  general  queueing  systems.  Among  these  are 
Marshall  [38,39],  Kingman  [40],  Iglehart  [41],  Daley  and  Moran  [42], 
and  Gaver  [43] ,  to  mention  a  few. 

6.2  The  Analytical  Results 

In  this  section  we  present  results  concerning  the  response 
functions  (W(t) )  which  are  feasible  when  the  scheduling  discipline  is 
based  only  on  attained  service  time  and  elapsed  waiting  time  of  jobs. 

In  Theorem  6.1  we  state  a  monotonicity  property  for  W(t) .  In  Theoran 
6.2  we  give  a  conservation  relationship  which  the  response  function  must 
satisfy.  In  Theorem  6.3  and  6.4  tight  la/er  and  upper  bounds  are  derived. 
As  a  result  of  Theorem  6.4,  a  necessary  condition  for  W'(t)  is  obtained 
in  Theorem  6.5. 

Theorem  6.1  W(t)  is  a  nondecreasing  function  of  t  or  equivalently 

W'(t)  =  >  0  (6.1) 

Proof:  See  Appendix  C 

Theorem  6.2  There  is  a  conservation  law  that  W(t)  has  to  satisfy, 
namely 

/°°  *.2 

W(t)  [1  -  B(t)  ]dt  =  2(1  -  '$Y  (6,2) 

For  T(t) ,  the  conservation  law  becomes 

/*°°  t-2 

^  T(t)  [1  -  B(t]dt  =  2(1  _  'py  (6.3) 

Proof:  See  Appendix  C 
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We  refer  to  Eqs.  (6.2)  and  (6.3)  as  Conservation  Laws  since  they 
are  based  on  the  conservation  of  average  unfinished  vork  in  the  system. 
Hus  places  an  integral  constraint  on  W(t)  (and  T(t) )  as  a  second 
necessary  condition,  regardless  of  scheduling  algorithm.  Hie  implica¬ 
tion  of  the  conservation  law  may  be  seen  by  recognizing  that  (1  -  B(t)  ] 
is  a  non-increasing  function  of  t.  Thus,  if  one  had  a  given  W(t)  as 
a  result  of  seme  scheduling  algorithms,  and  then  changed  the  al^>rithm 
so  as  to  reduce  W(t)  over  seme  interval  (0,tg) ,  then  the  conservation 
law  would  require  that  the  new  W(t)  be  considerably  above  the  old 
value  for  some  range  above  tg.  This  follows  since  the  weighting  factor, 
1  -  B(t),  is  smaller  for  large  t. 

With  the  help  of  Theorems  6.1  and  6.2,  we  now  proceed  to  prove 
the  main  theme  of  this  chapter. 


Theorem  6.?  The  lower  bound  W£(t)  of  waiting  time  functions  is 
given  by  the  waiting  time  for  the  PCFS  discipline  with  the  service  time 


distribution  truncated  at  t,  namely 


Mt(t>  - 


X  t2<t 

2d  -  e<t> 


(6.4) 


Note:  that  W£(0)  =  0  and  that  W£(°°)  =  W^^  (the  average  waiting  time 
for  the  PCFS  system  as  given  in  Section  2.2.1);  also  W'£(0)  =  W'^H  *  0. 


Proof:  See  Appendix  C 

Theorem  6.4  The  upper  bound  Wu(t)  of  waiting  time  functions  is 
given  as 


W„(t)  = 


? 


t*P 


<t 


u 


2(1- 


■^7 


(1  -  p>  1  -  p 


(6.5) 


<t 
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One  scheduling  discipline  which  gives  Wu(t)  is  a  two-level  systan  with 
both  levels  served  FCFS  and  switching  point  (see  Chapter  6)  at  t. 

Note  that  W^(0)  =  W£(°°)  =  W^pg,  that  VT(0)  =  0  and  that  W^(®)  = 

Proof:  See  Appendix  C 

As  a  consequence  of  Theorem  6.2  and  Theorem  6.4,  we  get  the 
following  necessary  condition  for  W'(t) . 

Theoran  6.5  For  a  response  time  function  W(t)  which  is  continuously 
differentiable,  W'(t)  =  can  not  be  monotonically  non-decreasing 

with  t. 

Proof:  See  Appendix  C 
6.3  Examples 

four  examples  are  given  in  this  section  to  danonstrate  the 
nature  of  the  tight  bounds  we  have  obtained.  As  a  performance  measure, 
the  equilibrium  average  waiting  times,  W(t) ,  are  plotted  as  a  function 
of  t.  we  begin  with  the  fyWL  system  (i.e. ,  Poisson  arrivals  and  ex¬ 
ponential  service) .  The  response  functions  of  Figure  6-1  are  given 
again  in  Figure  6-2  with  the  upper  and  lcwer  bounds  superimposed.  At 
t  =  0,  the  upper  bound  and  FCFS  start  at  the  same  point  because,  under 
the  constraint  of  the  conservation  law,  no  other  scheduling  algorithm 
can  give  longer  average  waiting  time  at  t  =  0  than  FCFS.  The  upper 
bound  approaches  the  FB  response  asymptotically  as  t  approaches  infin¬ 
ity.  Therefore,  a  customer  with  a  very  long  requested  service  time  (as 
compared  to  the  mean)  cannot  be  delayed  much  more  than  he  is  with  FB. 

The  lower  bound  starts  at  zero  (as  does  the  FB  curve) ,  increasing  less 
rapidly  with  t  than  the  upper  bound.  It  approaches  the  FCFS  curve 
asymptotically  as  t  goes  to  infinity.  Thus  we  note  that  the  least 
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discriminating  scheduling  algorithm  (PCFS)  touches  the  upper  bound  at 
t  ■  0  and  forme  the  asynptote  for  the  lower  bound  as  t  approaches  in¬ 
finity;  conversely,  the  most  discriminating  scheduling  algorithm  (FB) 
touches  the  lower  bound  at  t  ■  0  and  forms  the  asynptote  for  the 
upper  bound  as  t  approaches  infinity.  The  above-mentioned  behavior 
of  the  upper  and  lower  bounds  applies  not  only  for  the  HW1  system, 
but  also  holds  true  for  any  W/G/ 1  system  in  general,  although  the  rate 
of  convergence  for  the  bounds  to  their  respective  limits  varies  for  dif¬ 
ferent  service  distributions. 

Fbr  the  Moond  ocanple  we  choose  the  eystan  (VE/j/l*  In  thl. 
system  we  have 

(2y)2xe~2yx  x  >  0  (6.6) 

with  mean  service  time  equal  to  l/v;  the  r;eocnd  moment  of  this  distri- 
2 

bution  is  3/2 y  .  Because  the  second  moment  is  analler  than  that  of  the 

2 

exponential  distribution  (whose  value  is  2/y  ) ,  the  bounds  are  tighter 
in  this  example  than  the  HWl  case,  just  as  one  would  expect.  Figure 
6.3  shows  the  behavior  of  this  system  with  y  *  1  and  X  «  0.75.  It  is 
obvious  frcm  the  figure  that  far  t  >  5/y,  the  upper  ana  lower  bounds 
have  essentially  reached  their  asymptotic  form. 

In  the  third  exanple  we  show  the  bounds  for  the  H/H^X  system, 
where  H2  stands  for  hyperexponential  service  distribution  with 

dBfx)  "wlx 

=§^*-0.5^6  A  +  0.5y2e  x>0  (6.7) 

We  chose  ^  »  5y  ,  u2  ■  (5/9)  y  ,  resulting  in  a  mean  service  time  of 

2 

1/y*  The  second  moment  of  this  distribution  is  82/2 5y  .  Figure  6-4 
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shows  the  behavior  of  the  hVH^l  system  with  p  =  1  and  A  =  0.75.  The 
Upper  and  lower  bounds  approach  to  their  respective  limits  at  a  slower 
rate  than  either  H^l  or  because  of  the  larger  second  rroinent. 

Pot  our  last  exanple  we  choose  the  system  fVu/1  where  u  stands 
for  unifoim  service  distribution.  For  this  particular  exanple  we  have 

(0.25  2  <  x  <  6 

dB(x)  ) 

cET"  )  (6.8) 

‘  0  otherwise 

and  A  =  0.1875,  t  =  4.0,  p  =  0.75.  Figure  6-5  3 hows  the  behavior  of 
this  system.  Notice  that  when  t  >_  6 ,  the  upper  bound  coincides  exactly 
with  the  FB  curve  and  that  the  lower  bound  coincides  exactly  with  the 
PCFS  curve.  The  probability  of  having  any  customer  requesting  more  than 
six  seconds  of  service  in  this  exanple  is,  of  course,  equal  to  zero. 

Another  performance  measure,  W(t)/t,  is  given  in  Figure  6-6  for 
thefV'lVl  case  and  is  of  interest  bo  us,  since  (as  mentioned  in  Chapter 
5)  it  gives  seme  feeling  for  how  large  a  price  (in  terms  of  wasted 
time)  a  customer  must  pay  in  order  to  get  a  unit  of  service.  For  the 
case  of  RR,  this  measure  is  a  constant;  thus  each  customer  has  the  same 
penalty  rate,  regardless  of  his  service  time.  In  this  sense,  everyone 
is  treated  equally  in  the  RR  system.  The  curve  representing  FCFS  is 
monotonically  decreasing  with  t,  and  so  the  lonrrer  jobs  pay  at  a  smaller 
penalty  rate.  System  users  mic^t  then  attempt  to  "pool"  their  requests 
to  take  advantage  of  this  "quantity  discount."  Another  extreme  exanple 
is  provided  by  FB;  W(t)/t  increases  rapidly  when  t  is  small,  then 
drops  slowly  to  a  constant  (p/(l  -  p)).  A  customer  with  a  long  request 
can  do  better  by  breaking  his  job  into  analler  independent  jobs  and 
submitting  them  separately  to  the  system  (if  this  is  possible)  because 
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then  the  average  waiting  time  per  init  of  service  tire  will  be  greatly 
reduced. 

Figure  6-7  shews  the  range  of  the  bands  for  the  tyty)  system 
with  p  *  0.75,  0.5  and  0.25,  respectively.  As  can  be  seen,  the  region 
included  between  the  i^per  and  lower  bounds  for  a  particular  utilization 
factor  p  depends  heavily  an  p;  the  larger  tha  value  of  p,  the  great¬ 
er  is  the  vertical  separation  between  the  two  bounds,  thus  allowing 
larger  variation  of  the  mean  waiting  times  for  different  scheduling  al¬ 
gorithms. 
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FIGUFE  6-3 
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B0LND6  CN  lESPCNSE  FOR  li  =  1,  A  =  0.75,  p  =  0.75 
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BCXND6  CN  PENALTY  RATE  FOR  HWl,  U  =  1.0,  X  =  0.75,  p  =  0.75 


VARIATICN  OF  BOUNCE  FOR  fytyl  WITH  p  =  0.25,  0.50,  AND  0.75 

FIGLJFE  6-7 
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CHAPTER  7 


CONCLUSION  ;«ND  SUGGESTED  AREAS  FOR  FUTURE  RESEARCH 

In  Section  2.2,  we  made  u  survey  of  some  of  the  results  in 
modelling  and  analysis  of  scheduling  algorithms  that  provided  for  us 
the  starting  point  of  this  research.  In  Chapters  3  and  4,  as  an  exten¬ 
sion  of  this  line  of  work,  we  modelled  and  analyzed  the  family  of  sel¬ 
fish  scheduling  algorithms  and  a  new  family  of  algorithms  whose  perform- 
-ance  ranges  between  that  of  the  RR  and  the  FB  systems.  The  emphasis  of 
these  algorithms  is  to  introduce  parameters  into  the  models  so  that 
various  degrees  of  freedom  can  be  obtained  by  adjusting  these  para¬ 
meters.  It  is  now  possible  to  go  from  the  algorithm  (RTS)  which  shows 
no  discrimination  with  regard  to  job  length  to  that  discipline  which 
shows  maximum  discrimination  (re)  on  job  length  among  customers  in  a 
system.  We  are  able  to  shew  models  whose  performance  ranges  in  between 
these  two  extremes  on  a  continuum  basis.  In  Chapters  5  and  6,  we  ans¬ 
wered  some  of  the  fundamental  questions  regarding  the  existence  of  order 
and  structure  in  the  analytic  results  for  time-shared  corputer  systems. 
In  particular,  conditional  average  number  of  customers  in  the  system  for 
different  scheduling  algorithms  is  calculated  and  tight  upper  and  lower 
bounds  are  obtained  for  the  class  of  processor-si  laring  model  queuing 
systems. 

Since  we  have  limited  ourselves  to  the  modelling  and  analysis  of 
only  the  processor-sharing  systems  in  this  dissertation,  it  is  natural 
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then  to  ask  as  our  first  question  what  kind  of  extension  needs  to  be 
done  in  regard  to  this  area.  In  Chapter  6,  we  discussed  sene  of  the 
fundamental  properties  of  the  processor-sharing  models  regardless  of 
scheduling  algorithm;  specifically  wc  have  obtained  sane  of  the  neces¬ 
sary  conditions  that  a  given  response  function  has  to  folio/.  But  that 
is  only  part  of  che  answer;  the  question  as  to  what  are  the  necessary 
and  sufficient  conditions  for  a  given  response  function  to  be  feasible 
remains  unsolved. 

The  assumption  of  zero  quantun  size  is  somewhat  unrealistic  in 
the  real  world.  The  motivation  of  this  assumption  is  one  of  simplicity 
of  analysis  and  in  the  presentation  of  results  which  serve  as  good  ap¬ 
proximations  to  finite-quantun  models.  Also,  the  assumption  of  infinite 
population  my  be  undesirable  if  the  arrival  process  somehow  depends  an 
the  nurfoer  of  elastomers  in  the  system  [45,46] .  Clearly,  more  work  needs 
to  be  done  in  these  areas  as  well  as  the  case  for  more  general  arrival 
and  service  processes  so  that  results  that  are  of  more  sigiificance 
could  be  obtained. 

In  the  past,  most  of  the  effort  has  been  spent  on  the-  models 
with  single  resource.  It  is  true  that  the  central  processing  unit  is 
probably  the  most  irrportant  element  in  a  computer  system  and  the  schedul¬ 
ing  of  its  resources  is  vital  to  the  performance  of  such  a  system.  And 
if  the  size  of  the  memory  and  the  nurfoer  of  I/O  devices  are  adequate 
then  the  allocation  of  the  CPU  time  should  be  relatively  independent  of 
other  resources .  But  in  a  real  computer  system,  we  know  this  seldom  can 
be  attained.  The  speed  of  the  I/O  devices,  the  size  of  the  min  memory, 
the  allocation  schemes  of  the  main  memory,  the  size  of  pages  and  segments 
in  a  paged  memory  system,  program  behaviors,  as  well  as  the  data  chan- 
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nels  which  connect  the  I/O  devices  and  the  memory  all  have  sene  effect 
on  the  performance  of  a  oenputer  system.  For  exanple,  the  effect  of  a 
page  faulting  in  a  paged  memory  system  may  be  as  important  or  even  more 
important  than  the  scheduling  algorithm  in  some  cases.  If  a  system 
"thrashes"  as  described  by  Denning  147] ,  the  central  processing  unit 
would  be  idle  most  of  the  time  because  customers  page  fault  at  a  very 
high  rare,  and  it  would  then  make  little  difference  as  to  what  schedul¬ 
ing  algorithm  is  being  used  in  the  CPU.  Unfortunately,  very  little  work 
has  been  done  in  this  area  mainly  because  of  the  difficulty  in  analyzing 
such  a  queuing  system  with  two  or  more  inter-dependent  queuing  struc¬ 
tures.  However,  much  work  must  be  done  in  this  area  in  order  to  get  a 
better  understanding  of  the  behavior  of  the  time-sharing  oenputer  sys¬ 
tems. 

Another  very  important  question  that  has  not  been  answered  is 
the  one  of  optimization  where  this  term  itself  is  yet  to  be  well-defined. 
So  far  we  have  talked  about  modelling  and  analyzing  of  schedulign  al^or- 
ithns  that  usually  all  favor  the  short  job  over  the  long  one,  and  in  most 
of  the  cases  the  average  response  time  is  solved  for  as  the  performance 
measure  of  the  system.  We  may  ask  whether  this  is  the  only  valid  criter¬ 
ion  for  awarding  priority  in  a  time-sharing  system.  If  we  want  to  attract 
customers  with  long  jobs  to  a  oenputing  facility,  we  must  be  able  to 
award  high  priority  to  them  whenever  they  indicate  a  willingness  to  pay 
the  high  price.  Here  we  introduce  the  cost  of  delay  as  a  performance 
measure  and  as  a  criterion  for  optimality  for  time-shared  corputer  sys¬ 
tems.  Even  though  the  modelling  and  analysis  of  a  system  with  different 
C06t  functions  assigned  to  different  users  may  become  very  difficult,  a 
lot  of  work  is  needed  in  this  area  if  seme  criteria  of  optimization  of 
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models  with  costs  taken  from  ti»c  viewpoint  of  the  systan  and  the  users 
are  to  be  formulated. 
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THEJOFEMS  AND  PROOF'S  FOR  THEOFEMS  IN  CHAPTER  3 
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A.l 


Theorem  3.2  and  Its  Proof 


Theorem  3.2  For  any  customer  requiring  t  seconds  of  service, 
the  time  he  spends  in  the  queue  box  is  independent  of  the  time  he  spends 
in  the  service  booc  (or  independent  of  the  time  he  wasted  in  the  service 
box  because  t  is  not  a  random  variable) . 


Proof:  We  prove  this  theorem  by  using  an  argument  for  busy  period  dis¬ 
tributions  [19].  The  Laplace  transform  P*(s)  of  the  distribution  of 
the  busy  periods  for  the  fyG/1  system  is  given  as  (see  Eq.  (2.12)) 

P*(s)  *  B*  [s  +  A  -  AP*(s)  ]  (A.l) 

with  mean  value  g^  and  second  moment  g2  as  follows: 


9 


(A.  2) 


where 


(1  -  P) 


7 


77  _  lim  <?B*(s> 

1  "O-?" 


second  moment  of  service 
time  distribution 


(A.  3) 


(A.  4) 


Since  all  the  work  has  to  be  done  in  the  service  box  and  since 
the  arrival  process  to  the  service  box  when  it  is  not  idle  is  Poisson, 
the  service  box  itself  can  be  regarded  as  an  fV<Vl  system  with  average 
arrival  rate  A'.  The  Laplace  transform  P*(s)  of  the  busy  period 
distribution  can  also  be  expressed  as 

P*(s)  =  B*[s  +  A'  -  A'  P*(s)]  (A. 5) 

with  mean  value  gj^  and  second  moment  g£ 

Preceding  page  blank 
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(A.  6) 


7 


a  -  p’) 


(A.  7) 


PRIORITY 


BUSY  PERIOD  DISTRIBUTION  OF  AN  SSA  SYSTEM 
FIGUFE  A-l 


Itefer  to  Figure  A-l;  a  customer  arrives  to  an  enpty  system  at 
time  Tq  and  starts  a  busy  period  in  the  system  as  well  as  in  the  ser¬ 
vice  box  because  he  does  not  have  to  spend  any  time  in  the  queue  box. 
Fran  Tq  to  T^,  the  service  box  is  busy  as  more  customers  arrive  to  the 
system.  Suppose  that  the  last  customer  in  the  service  box  leaves  at  ; 
that  marks  the  end  of  this  "small"  busv  period.  If  the  queue  box  is  not 
enpty  at  this  time,  the  customer  with  the  highest  priority  will  be  ad¬ 
mitted  iirmediately  into  the  service  box,  thus  starting  another  small 
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busy  period.  The  priority  of  the  service  box  will  adjust  to  whatever 
priority  this  cup  toner  has;  usually  this  means  a  drop  of  priority  as 
shown  by  Figure  A-l.  If,  when  the  last  customer  in  the  service  box  de¬ 
parts  and  there  is  no  cu; toner  waiting  in  the  queue  box,  the  system  goes 
idle  and  this  means  the  end  of  a  busy  period  of  the  system.  In  order  to 
differentiate  this  with  the  small  busy  period  we  mentioned  earlier,  we 
call  this  a  large  busy  period.  In  short,  a  large  busy  period  is  the 
time  interval  when  the  system  is  busy;  and  a  small  busy  period  is  the 
time  interval  the  service  box  is  busy  (with  no  drop  in  priorities) . 
Obviously,  a  large  busy  period  us  tally  contains  one  or  more  small  busy 
periods.  For  an  ty/tyi  system  with  average  arrival  rate  A  and  service 
rate  p  ,  the  average  length  of  a  large  busy  period  is 

larly,  the  average  length  of  a  small  busy  period  is  *  Therefore, 

1  -  o' 

on  the  average,  there  are  (j-""p  )  snail  busy  periods  in  a  large  busy 
period.  In  Figure  A-l,  the  interval  (Tg,Ty  is  a  large  busy  period, 
while  time  intervals  (Tq,T^)  ,  (T^/Tj),  and  (T2,T3)  are  small  busy 
periods. 

In  order  to  prove  the  theorem,  let  us  refer  to  Figure  A-l. 

Assure  that  customer  A  starts  a  large  as  well  as  a  small  busy  period; 
customer  B  enters  the  service  box  at  and,  therefore,  starts  a 
small  busy  period  but  not  a  large  cne.  Thus,  customer  A  does  not  have 
to  wait  in  the  queue  box  while  customer  B  does.  After  they  enter  the 
service  box  (at  different  times) ,  customers  A  and  B  will  see  the  same 
environment  (FVQ/1  system  with  average  arrival  rate  A') .  There  is  no 
way  to  differentiate  these  two  customers  statistically  from  the  time 
they  enter  the  service  box  because  they  both  start  a  small  busy  period 
and  all  small  busy  periods  are  identically  distributed,  as  expressed  by 
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EJq.  (A. 5)  [19].  Thus,  for  customers  A  and  B,  the  tiiie  they  spend  in  the 
service  box  must  be  independent  of  the  time  they  spend  in  the  queue  box. 

Next,  let  us  look  at  customers  C  and  D  under  the  assumption  the 
distances  (Tc  -  T^)  and  (T^  -  T^)  are  the  sane,  that  is,  C  and  D 
enter  their  respective  small  busy  periods  at  the  same  corresponding 
time  (i.e. ,  the  service  box  has  been  busy  for  the  same  amount  of  time 
since  the  start  of  current  small  busy  period) .  Then,  as  far  as  the  tine 
spent  in  the  service  box  is  ccnoemed,  there  is  no  difference  between 
C  and  D  statistically  because  the  small  busy  periods  are  identically 
distributed,  but  their  waiting  times  in  the  queue  box  are  different  (as 
represented  by  their  respective  priorities  when  they  enter  the  service 
box)  as  depicted  by  Figure  A-l.  Therefore,  for  customers  C  and  D,  the 
theorem  holds  true;  but  C  and  D  can  be  customers,  so  the  independent 
assurrpticn  must  be  true  for  every  customer. 


Q.E.D. 


A.  2  Theorem  3.3  and  Its  Proof 

Theorem  3.3  The  Laplace  transform  Q*(s)  of  the  density  func¬ 
tion  of  the  waiting  time  spent  in  the  queue  box  by  a  customer  requiring 
t  seconds  of  service  time  (actually,  it  is  independent  of  t  as  we 
explained  earlier)  is 


Q*(s) 


(1  ~  p)  .  A,B*(s)  -  A1  +  s 
(1  -  p')  AB*(s)  -  A  +  s 


with  first  moment  equal  to 


„  _  £.im 
q  "  s-0 


3Q*(s) 

dS 


A? 

ra  -  P) 


A’? 

2TT-  p’) 
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Proof: 


I - 1 


CEXDMPOSmCN  OF  THE  SSA  SYSTEM 
FIGUFE  A- 2 


As  we  said  earlier,  the  scheduling  algorithm  in  the  service  box 
will  not  affect  the  waiting  time  distribution  as  long  as  no  feedback 
from  the  service  box  to  the  queue  box  is  possible.  For  convenience, 
let  us  assure  that  an  PCFS  scheduling  algorithm  is  being  used,  narrely , 
after  a  customer  enters  the  service  box,  he  will  be  served  on  an  PCFS 
basis  to  aoupletion.  This  makes  the  whole  system  PCFS.  The  Laplace 
transform,  S*(t,s) ,  of  the  equilibrium  response  time  distribution  for  an 
PCFS  system  is  well  known  [19] ,  namely 

s.(t,s)  =  B.(s)  .  (A.8) 

After  the  tagged  customer  enters  the  service  box,  he  is  in  an¬ 
other  PCFS  system  with  the  average  arrival  rate  A'  instead  of  A,  thus 
we  can  easily  get  Y*(t,s)  as 

Y*(t's)  -  B*(s>  wU  -’I'  7  s  <*•»> 

From  the  independent  property  proved  in  Theorem  3.2,  Q*(s)  can 
easily  be  obtained  as  the  ratio  of  S*(t,s)  and  Y*(t,s) 
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Q*(s) 


S*(t,s) 

V*(t,s) 


.  (1  -  P)  X'B*(s)  -  X‘  +  s 
■  Ti'  -  p’)  A  b*<s)  -  \  +  iT 


(A.  10) 


O.li.D. 
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Theorem  4.1  and  Its  Proof 

Theorem  4.1  The  average  response  time  T(t)  for  the  systan 
with  the  scheduling  algorithm  defined  by  g(t)  »  ge-^,  is  the  solution 
of  the  following  integro-differential  equation 

(1  -  p)T(t)  -  t  +  T'ftjje91  +  e91  -  l]  M/9dt 

-p/VdjIe^-e^tlf^dt 

Proof;  In  Section  2.2.9  we  discussed  the  attained  service  and  remarked 
that  the  density  of  customers  having  obtained  t  seconds  of  attained 
service  is  given  by  [30] 

n(t)  *  XT'(t)  [1  -  B(t)J  (B.l) 

Although  n(t)  is  not  available  to  us,  nevertheless,  we  can  use  it  as 
an  intermediate  step  lor  the  calculation  of  T(t) . 


g(t) 


FIGURE  B-l 


In  Figure  B-l  we  plot  g(t)  against  t,  where  g(t)  denotes 
the  relative  rate  of  attaining  service  for  customers  with  different 
amounts  of  attained  service  times.  What  we  are  interested  in  is  that 

Preceding  page  blank 
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given  an  elapsed  time  interval  T,  what  are  the  relative  aroints  of 
services  attained  by  customers  who  have  different  accunulated  aroints 
of  service  times  to  begin  with.  In  other  words,  we  wish  to  insider 
the  time  interval  during  which  a  customer  (let  us  call  him  customer  A) 
remains  in  the  system  and  gains  t  seconds  of  service.  During  this 
interval,  we  are  interested  in  hew  much  service  time  x  a  cus barer 
(let  us  call  him  customer  B)  can  get  during  this  same  time  interval 
given  that  customer  B  has  received  x  seconds  of  service  just  prior  to 
the  beginning  of  this  time  interval.  In  order  to  calculate  x,  let  us 
first  make  the  following  observation: 


=  f(y) 

dT(y) 


(B.2) 


dy  dz 

dT(y)  =  dT(  z) 


c 


(B.3) 


where  x  and  y  are  arbitrary  time  instants  on  the  time  axis  and  T(y) 
is  a  random  variable  denoting  the  response  time  for  getting  y  seconds 
of  service  time.  Equation  (B.2)  states  that  the  rate  of  gaining  service 
for  a  customer  with  y  seconds  of  attained  service  is  equal  to  f  (y) , 
the  fraction  of  the  total  service  facility  that  is  allocated  to  him. 
Equation  (B.3)  says  that  a  customer  with  y  seconds  of  service  gains 
service  at  a  rate  proportional  to  e"97;  and  a  cus toner  with  z  seconds 
of  attained  service  at  a  rate  proportional  to  e”92.  The  proportionally 
constant  is  given  as  c  which  does  not  depend  on  either  y  or  z. 
Equation  (B.3),  of  course,  just  restates  the  definition  of  the  algorithm 
as  defined  fcy  Eq.  (4.1)  and  is  a  direct  consequence  of  Bqs.  (4.2)  and 
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(B.2) .  Since  the  time  interval  during  which  customer  A  gains  t  sec 
ends  of  service  is  the  same  as  that  during  which  customer  B  gains  x 
seconds  of  service,  we  get  the  following  equation 


Substituting  Eq.  (B.3)  into  Eq.  (B.4),  we  get 


c  J'  e^dy  =  c  J“  e^zdz 


or 


.gt  .  1  =  eg(T-Hc)  _  egT 


Rearranging  Eq.  (B.6) ,  we  get 


(B,4) 


(B.5) 


(B.6) 


x  =  ilnte^te91  -  1)  +  1]  (B.7) 

Thus,  in  order  for  a  tagged  customer  to  get  t  seconds  of  ser¬ 
vice  time,  the  service  facility  has  to  serve  customers  who  are  already 
in  the  system  with  t  seconds  of  attained  service  when  the  tagged  cus¬ 
tomer  arrives  to  the  system  each  with  up  to  x  seconds  of  service. 

Since  the  density  of  customers  in  the  system  is  assumed  to  be  distributed 
as  n(x) ,  the  total  amount  of  work  U(t)  which  needs  to  be  done  to  them 
is  calculated  as 


n(x)  (t<J{]dx 

J 

n(x)  (Vu)  [1  -  e-MX]dx 
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(B.8) 


mJ  n(x)(l/y)|l  -  -  1)  +1]  ^g|dT 

where  t  is  the  average  anoint  of  service  tine  with  the  service  dis¬ 
tribution  truncated  at  x. 

U(t)  takes  care  of  all  the  customers  who  are  in  the  system 
prior  to  the  arrival  of  the  tagged  customer.  The  next  step  for  us  is 
to  calculate  the  anoint  of  work  V(t)  that  needs  to  be  done  to  those 
customers  who  arrive  to  the  system  later  than  the  tagged  customer  but 
before  he  leaves  the  system  with  his  t  seconds  of  service.  Let  us 
assure  that  one  of  those  later  comers  (let  us  call  him  custorer  C)  ar¬ 
rives  to  the  system  when  the  tagged  customer  (customer  A)  has  accumulated 
exactly  t  seconds  of  service.  We  wish  to  find  out  how  much  service 
w  that  customer  C  is  going  to  get  during  the  same  time  interval  that 
customer  A  gets  served  by  (t-r)  seconds.  Eq.  (B.4)  can  be  readily 
modified  as 


Rearranging  Eq.  (B.10) ,  w  can  be  calculated  as 

W  =  ilnle9*  -  e^  +  1] 


(B.9) 


(B.10) 


(B.ll) 
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w  is  the  maxirmm  amount  of  service  custcner  C  con  get  during  the  in¬ 
terval  when  the  tagged  custcner  aocunulates  his  attained  service  tine 
from  t  to  t  seconds.  The  actual  service  time  customer  C  gets  before 
he  leaves  the  system  is,  cn  the  average,  srraller  than  w.  It  can  easily 
be  calculated  for  exponentially  distributed  service  tines  as 

-  i<l  -  e'1")  (B.12) 

During  the  differential  time  interval  when  the  tagged  customer  gets  from 
t  to  t  +  dr  seconds  of  attained  service,  there  are,  on  the  average, 
T'  ( x) dT  (Little's  Result)  new  arrivals  coming  to  the  system,  there¬ 
fore,  V(t)  can  be  expressed  in  the  following  equation: 

V(t)  AT'(t)  ftjJdT 

=  /’t  AT'(t)  i  [1  -  e_pw]dT 

•'o  y 

= t)  |l  -  'egt  -  e91  +  l]"p/g|dT  (B.13) 

The  average  response  time  T(t)  (i.e. ,  the  total  average  time  the  tag¬ 
ged  customer  spends  in  the  system  in  order  to  get  served  for  t  seconds) 
is  the  sun  of  t,U(t)  and  V(t) .  From  Eq.  (B.7)  and  (B.13),  T(t)  is 
given  by 

T(t)  =  T  +  A(t)  +  B(t) 

-  t  +/”  n(T)i  |l  -  (e"gT(egt  -  1)  +  lr^l* 

Jo  v 
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(B.14) 


+  p^t'r,(T,|1  "  fe9t  -  e9T  +  irM/9|dr 

n(t)  is  given  by  Eq.  (B.l),  sibstituting  n(x)  into  Eq 
sinplifying,  we  get 

T(t)  -  t  +  i  f  n(T)dt  -  p  f  T'fxJe^Uf^e9*-  - 

0  *'o 

+  pT(t)  -  p  f t  T'  (t)  (e9*  -  e91  +  l]“M/9dr 
J0 

Por  Vi/tf/ 1  systems  (from  Bq.  (2.5))  the  average  nurber  in 
given  by 

/oo 

n(x)dx  =  — 

1  “  P 

Equation  (B.15)  beocmes 

T(t)  (1-0)  -  t  +  -  of"  T.  (T)e-MVTIegt  _  + 

M  •'o 

-  P  f  T'(x)  [e9*  -  e91*  l]’p/9dx 
0 

Finally,  we  get 


T(t)  (1 


-  p)  -  t  +  -  pf  r  (X)  [e9*  +  e91  -  1] 

0 

-  P  /*t  T'  (x)  (e9*  -  e91  +  lj^dx 

•'A 


Q.E.D. 


•  (B.14)  and 

1)  +  l]“p/gdx 

(B.15) 

the  system  is 

(B.16) 

e9T]-w/gdT 

(B.17) 

-^®dT 

(B.18) 
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APPENDIX  C 

THEOREMS  AND  PROOFS  FOR  THEOREMS  IN  CHAPTER  6 
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C.l  Theorem  6.1  and  Its  Proof 

Theorem  6.1  W(t)  is  a  nondecreasing  function  of  t  or 
equivalently 

W'(t)  =  >  o 

Proof:  We  are  considering  scheduling  disciplines  in  which  each  job  is 
characterized  by  (1)  its  attained  service  time,  t  and  (2)  its  elapsed 
waiting  time,  t^.  Therefore,  the  state  of  the  system  is  the  nurrber  of 
jobs  in  the  system  and  t  and  for  each  job.  A  particular  sched- 
using  discipline  nay  effectively  ignore  one  or  both  of  these  parameters, 
but  this  information  is  assured  to  be  available  for  each  job.  Because 
scheduling  decisions  are  made  only  on  the  basis  of  these  two  parameters, 
the  following  statement  is  self-evident.  The  history  of  a  job  requiring 
^  _>  t  seconds  of  service  from  the  time  of  its  arrival  at  the  system 
until  it  has  received  t  seconds  of  service  is  independent  of  the 
exact  value  of  t^.  A  direct  consequence  of  this  fact  is  that  W(t)  is 
a  nondecreasing  function  or  equivalently 

W’(t)  =  >  0  (C.l) 

Q.E.  D. 

C.2  Theorem  6.2  and  Its  Proof 

Theorem  6.2  There  is  a  conservation  lav  thatW(t)  has  to 
satisfy,  namely 

~2 

J0  WttHl-Bttlldt.jjfVpJ- 

For  T(t) ,  the  conservation  law  becomes 

Preceding  page  blank 
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/  waa-BWidt-jg^. 

Proof:  From  [30)  we  have  that 

n(t)  =  All  -  B(t) )  [W*  (t)  +  1)  (C.2) 

where  n(t)  is  the  density  of  jobs  in  the  system  with  t  seconds  of 
attained  service  time.  We  define  the  "work"  in  the  system  at  time  t 
as  the  additional  time  required  to  empty  the  system  if  no  new  arrivals 
are  permitted  entry;  this  is  also  referred  to  as  the  "unfinished  work" 
and  as  the  "virtual  waiting  time."  The  mean  work  W  in  the  system  can 
be  expressed  as 

w  =  j  n(t)E  [remaining  service  time  for  a  job  with  attained 
service  time  of  t]dt 

or 

' «  r?wdt 

Substituting  from  (C.  2) 


w  =  A  r (W'(t)  +  1)  /"( t  -  t) dB ( t) dt 

J0 

By  changing  the  order  of  integration 


(W’(t)  +  1)(T  -  t)dt 
Integrating  the  inner  integral  by  parts 


c3B(t) 


(C.3) 


f  (W'(t)  +  1)  (T  -  t)dt  =  (T  -  t)  (W(t)  +  t) 

yo 

/■T  1 
*  /  [W(t)  +  tjdt 

y0 

Substituting  into  Eq.  (C.3) 


+  /Iwit,  +  t)  dt 
•/0 


134 


W  =  A  f  f  [W(t)  +  t]dt  dB  (t) 

•'o  •'o 


Again  changing  the  order  of  integration 


/OO  oo 

IW(t)  +  t]  /  <ffi(T)dt 


But  integrating  by  parts,  we  have  that 


r 


t[l  -  B(t)  ]dt 


7 


Moreover,  the  mean  work  in  the  system  is  known 


W  = 


X? 

2(1  -  p) 


(C.4) 


Thus  we  have  the  following  conservation  laws  for  T(t)  and  W(t)  ; 


t 

pT 


=  f  T(t)  (1  -  B(t)  ]dt 

■'o 


and 


2 

r^fc— y  =  J  «(t)  [1  -  B(t)  ]dt 


(C.5) 


(C.6) 


C.3  Theorem  6.3  and  Its  Proof 


Q.E.D. 


Theorem  6.3  The  lower  bound  W^(t)  of  waiting  time  functions 
is  given  by  the  waiting  time  for  the  PCI'S  discipline  with  the  service 
time  distribution  truncated  at  t,  namely 


At 


Wfl(t)  = 


<t 


2<x  ‘  P<tJ 


Note  that  W£(0)  =  0  and  that  W£(»)  =  (the  average  waiting  time  for 

the  PCFS  system  as  given  in  Section  2.2.1) ,  also  W'^(O)  *  W'^(®)  =  0. 
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Proof;  We  claim  that  to  minimize  W(x)  the  scheduling  discipline  must 

1.  never  service  jabs  with  attained  service  time  greater 
than  or  equal  to  x  while  there  are  jobs  in  the  system 
with  attained  service  time  less  than  x,  and 

2.  never  preempt  a  job  once  it  has  been  selected  for  service 
until  it  has  at  least  x  seconds  of  attained  service 
time, 

Under  these  conditions  the  response  function  in  the  interval  (0,x)  is 
just  the  response  function  for  a  ncnpreonptive  system  with  service  times 
truncated  at  x.  Ebr  convenience  we  will  assume  a  FCFS  scheduling  dis¬ 
cipline.  In  this  case  the  response  function  (denoted  wpxrs-x^  has 
the  form  shown  in  Figure  C-l  (see  Section  2.2.9) .  Note  that 
WrcFS-x(t)  =  ®  over  (0,x) .  The  scheduling  of  jobs  with  attained  service 
time  greater  than  x  is  of  no  concern  in  this  argument  as  long  as  con¬ 
dition  1  is  maintained. 

W(t) 


RESPONSE  FOR  FCFS  UP  TO  X  SECONDS  OF  SERVICE 
FIGURE  C-l 
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Let  Wx  be  the  mean  work  in  the  system  excluding  work  to  be 
done  on  jobs  beyond  providing  x  seconds  of  attained  service  to  each. 

In  other  words,  if  a  job  requires  t  >  x  seconds  of  service  and  has 
received  y  <  x  seconds  of  service,  its  contribution  to  Wx  is 
x  -  y.  By  the  same  method  used  to  derive  Eq.  (C.5)  it  can  be  shown  that 


Wx  =  *^X[W(t)  +  t]  [1  -  B ( t)  ] dt 


Now  since  Wj^g^tt)  has  minimum  slope  (i.e. ,  0)  only  over  the  inter¬ 
val  (0,x) ,  and  due  to  the  monotcnicity  given  in  Eq.  (C.l) ,  if  any 
other  response  curve  W(t)  is  such  that  W(x)  <  (x)  it  must  be 

such  that  W(t)  <  f°r  0  <  t  <  x.  But  under  condition  1  above, 

Wx  has  its  miniimxn  value  since  work  in  this  class  is  continuously  de¬ 
creased  at  maximum  rate  whenever  there  is  such  vork  in  the  system. 
Therefore,  for  any  W(t) , 


\f [W(t)  +  tj  [1  -  B(t)dt 


+  tm -■(«]* 


Thus  we  conclude  that  W(t)  <  Wp^pS_x^t^  in  (0,x)  is  impossible  and 
therefore  W(x)  >  Wj^^fx) . 

The  lower  bound  Wfc(t)  is  given  by  the  waiting  time  for  the 
PCFS  discipline  with  the  service  times  truncated  at  t,  namely 


Mt<«  - 


X? 


<t 


2<1  "  P<t) 


(C.7) 


Q.E.D. 
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C.4 


Theorem  6.4  and  Its  Proof 


Ihaoran  6.4  The  upper  bound  W^(  t)  of  waiting  time  functions 
is  given  as 


X? 

V»  = 


tp 


<t' 


T'~ 


<t 


<t 


Qie  scheduling  discipline  which  gives  Wu(t)  is  a  two-level  systan 
with  both  levels  served  FCFS  and  switching  point  (see  Chapter  6)  at  t. 
Note  that  Wu(0)  =  W£(«)  -  that  W^(0)  =0  and  that  W^(»)  >  jJL-. 

Proof:  In  this  case  we  begin  with  a  discrete  time  systan. 

Assune  that  the  service  time  distribution  is  of  the  form 

Pr (service  time  *  kg]  =  k  =  1,2,3,... 

where  q  is  the  quantun  size.  Therefore,  the  only  possible  service 
time  requirements  are  multiples  of  q.  We  shall  also  assune  that  arri¬ 
vals  may  take  place  only  during  the  instant  before  the  end  of  a  quantun 
and  that  the  processor  is  assigned  to  a  job  for  a  quantun  at  a  time. 

The  probability  that  an  arrival  takes  place  at  the  end  of  a  quantun  is 
Xq  so  that  the  mean  arrival  rate  is  X.  It  should  be  clear  that  any 
continuous  service  time  distribution  can  be  approximated  arbitrarily 
closely  by  a  discrete  time  distribution  by  letting  q  approach  0. 

Also,  these  restrictions  on  the  service  discipline  and  arrival  mechanism 
are  effectively  eliminated  when  q  -*>  0.  In  this  discrete  time  model  our 
goal  is  to  maximize  W(kq) . 

We  claim  that  the  following  scheduling  rule  is  necessary  and 
sufficient  to  maximize  W(kq) :  no  allocation  of  a  kth  quantun  is  made 
bo  any  job  when  there  is  some  other  job  in  the  systan  waiting  for  its 


138 


its  jth  quantun  where  j  ?  k.  We  note  in  passing  that  many  scheduling 
disciplines  will  satisfy  this  rule. 

We  relabel  the  time  axis  so  that  t  =  0  at  an  arbitrary  jroint 

in  sate  idle  period.  The  times  at  which  some  job  is  allocated  to  a  kth 

♦>!> 

quantun  we  call  "critical  times."  Let  c^  be  the  time  that  the  i 
critical  time  occurs.  We  wish  to  maximize  c£  (the  average  of  c£)  for 
some  fixed  £,  and  we  will  show  that  to  acocrrplish  this  it  is  necessary 
and  sufficient  to  satisfy  the  condition  that  at  the  1th  critical  time 
no  job  is  waiting  for  a  j**1  quantum  where  j  /  k.  Certainly  this  con¬ 
dition  is  necessary  since  if  a  proposed  scheduling  discipline  did  not 
have  this  property  then  c£  can  easily  be  increased  when  the  condition 
is  not  satisfied  as  follows:  follow  the  proposed  schedule  uitii  the 
point  where  the  4  th  critical  time  would  occur  and  then  assign  a  quantun 
to  a  job  waiting  for  its  j**1  k)  quantun. 

Since  we  have  already  shewn  necessity,  to  prove  the  sufficiency 
of  the  condition  for  maximizing  c£,  we  need  only  show  that  any  schedule 
satisfying  the  condition  yields  the  same  value  for  c£.  Let  A  be  any 
scheduling  algorithm  which  satisfies  the  rule  that  at  the  1th  critical 

U. 

time  no  job  is  waiting  for  a  j  quantum  where  j  ^  k.  Let  a£  be  the 
time  at  which  the  £  job  arrives  which  will  require  at  least  kq 
seconds  of  service.  The  state  of  the  system  at  a£  will,  in  general, 
depend  on  the  algorithm  A.  In  particular,  the  nurber  of  critical  times 
that  have  occurred  prior  to  a£  (let  this  be  s)  is  a  function  of  A. 
Let  Ea(c£  -  a£|  state  of  systems  at  a£]  be  the  expected  value  of 
c£  -  a£  under  algorithm  A  conditioned  on  the  state  of  the  system  at 
a£.  The  state  of  the  system  is  given  by  the  nurber  of  jobs  in  the 
system,  the  attained  service  time  of  each  job  in  the  system  and  s,  the 
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nunber  of  critical  tires  that  have  occurred.  Thus  we  have 
EA^c£  “  a£  I state  of  system  at  a£] 


*  [remaining  work  in  system  not  requiring  a  k^1 
quantun|  state  of  system  at  a£] 

+  (i  -  s  -  1)E [remaining  service  tire  for  job 

with  (k  -  l)q  seconds  of  attained 
service] 

+  (k  -  l)q 

+  At<(k  -  l)qEAIc£  "  a£ I  state  of  the  system  at  a£]  (c.8) 


But  the  sum  of  the  first  two  terns  on  the  righthand  side  of  this  equa¬ 
tion  is  equal  to  the  expected  amount  of  work  in  the  system  at  a£  given 
the  state  at  a£.  Thus 


EAIC£  "  a£  I  state  of  system  at  a£] 

=  EA[work  in  system  at  a£  state  at  a£] 

+  (k  -  l)q 

+  At<(k  -  l)qEAIC£  "  a£  I  state  of  system  at  a£] 
Hsmoving  the  condition  on  the  state  of  the  system  at  a£  we  have 
EA^c£  “  a£^  =  in  the  system  at  a£] 


or 


+  (k  '  1)q  +  At<(X  -  l)qEA>Ct  -  V 


p  ,  EA[work  1x1  ^stsn  at  a.]  +  (k  -  1  )q 

EAtC£  ~  a£]  * - - - — - 


1  -  At 


<(k  -  l)q 
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But  EA[work  in  system  at  a£]  is  not  a  function  of  the  particular 
scheduling  algorithm  and  therefore  EA(c£  -  a£]  does  not  depend  on  A. 
Sinoe  Elc£]  =  E[c  -  a£]  +  Ela£]  and  the  ri^it-hand  side  is  independent 
of  A,  E{c£]  is  independent  of  A.  Note  that  the  form  of  Eq.  (C.8) 
depended  on  A  having  the  property  that  at  c£  there  are  no  jcbs  in 
the  system  waiting  for  a  j  quantun  where  j  5*  k.  We  have  now  shown 
that  this  condition  is  necessary  and  sufficient  to  maximize  E[cJ  (=  c0) . 

We  now  show  that  the  general  scheduling  rule  to  maximize  W(kq) 
is  the  same  rule  which  maximizes  c£  applied  for  all  £.  We  have 

n  n 

_  £a£ 

W(kq)  =  lim  ^ - -  -  (C.9) 

n-*» 

The  a£  are  independent  of  the  scheduling  discipline  and  the  proposed 
scheduling  rule  is  necessary  and  sufficient  to  individually  maximize 
the  c£.  Therefore,  the  same  role  is  necessary  and  sufficient  to  maxi¬ 
mize  W(kq) ,  which  establishes  our  earlier  claim. 

It  should  be  clear  that  in  a  continuous  time  system  we  can  ap¬ 
proach  the  maximun  of  W(x)  by  the  following  role:  no  job  with  attain¬ 
ed  service  time  in  the  open  interval  (x  -  e,x)  (for  e  >  0)  is  serviced 
while  there  is  a  job  waiting  for  service  which  has  attained  service  time 
outside  this  interval.  By  permitting  e  to  shrink  to  zero,  we  approach 
the  maximun  for  W(x) . 

One  scheduling  discipline  which  maximizes  W(x)  is  the  two-level 
system  in  which  jobs  are  served  POPS  in  the  first  level  up  to  x“ 
seconds  of  attained  service.  A  job  which  does  not  finish  is  placed  in 
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the  second  level  queue.  Hie  second  queue  is  serviced  PCFS  to  carpi et ion. 
The  second  queue  has  a  lower  priority  and  is  only  serviced  when  the 
first  queue  is  erpty  (see  the  ML  systems  described  in  Section  3) .  This 
queueing  system  satisfies  the  condition  for  maximizing  W(x)  and  there¬ 
fore  from  Eq.  (2.57)  we  have 

wu(t)  =  2(1  -  p*t) 7i  -  o) +  (c-10» 


Q.E.D. 


C.5  Theorem  6.5  and  Its  Proof 

Theorem  6.5  For  a  response  time  function  W(t)  which  is  con¬ 
tinuously  differentiable,  W'(t)  =  cannot  be  monotonically  non¬ 

decreasing  with  t. 

Proof:  I^t  us  prove  this  theorem  by  contradiction.  Suppose  that  we 

can  find  aW'(t)  which  is  monotonically  non-decreasing  with  t,  then 

W(t)  either  does  not  intersect  W^(t)  which  represents  the  waiting 

time  function  for  the  RR  system;  or  it  intersects  WDD(t)  at  only  one 

HR 

point  as  shown  in  Figure  C-2.  If  W(t)  does  not  intersect  with  W  (t) 
then  it  either  lies  entirely  above  W(t)  or  entirely  below  W(t)  , 
neither  of  these  two  situations  is  possible  because  they  violate  the 
conservation  law  as  depicted  by  Eq.  (C.5) . 

Oi  the  other  hand,  if  W(t)  crosses  W^ft)  at  t ^  since  W(t) 
is  continuously  differentiable  and  non-decreasing,  W'(t)  has  to  be 
larger  than  W'pp(t)  for  all  t  >  t^.  But  in  Theorem  6.4,  we  prove  that 
W^W  =  W’p^t)  =  j-  P  p,  thus  we  have  W'(t)  >  W^H  for  all  t  >  t.. 

It  means  that  W(t)  increases  at  a  faster  rate  than  the  upper  bound  for 
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all  waiting  time  functions.  Sooner  or  later,  W(t)  will  intersect 
Wu(t)  and  then  assumes  larger  value  than  Wu(t) .  This,  of  course, 
violates  the  definition  of  the  ipper  bound. 


FIGUFE  02 


Q.E .  D. 
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